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TRANSGENIC PLANTS CONTAINING ALTERED LEVELS 
OF STEROID COMPOUNDS 

5 TECHNICAL FIELD 

The present invention relates to biotechnology with an 
emphasis on plant biotechnology, and particularly 
biotechnology affecting the biosynthesis of steroid 
compounds . 

10 

BACKGROUND 

Enhancement of the nutritional or health benefits of 
oils through genetic engineering is being addressed 
throughout the agricultural community. Several approaches 

15 involve manipulation of already present cellular 

biosynthetic pathways. Steroid biosynthetic pathways are of 
current interest, particularly for the enhancement of health 
benefits from food oils. 

Several related U,S. patents address increasing sterol 

20 accumulation in higher plants. Those patents include U.S. 

Patent No. 5,589,619 "Process and Composition for increasing 
squalene and sterol accumulation in higher plants" 
(accumulation of squalene in transgenic plants by increasing 
HMGR activity) and U.S. Patent No. 5,306,862 "Method and 

25 composition for increasing sterol accumulation in higher 

plants" (increasing HMGR activity to increase plant sterol 
accumulation- -including sterol and cycloartenol , which 
affects insect resistance- -in tobacco, tomato, corn, carrot, 
soybean, cotton, barley, arabidopsis, guayule and petunia; 

30 seeds with elevated sterol/cycloartenol , 7S promoter and 
CaMV promoters), U.S. Patent No. 5,365,017 "Method and 
composition for increasing sterol accumulation in higher 
plants" (DNA construct with HMGR-CaMV 35S, transgenic 

-1- 
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plants, hybrid plants, com, soy, barley, tomato, 
Ar^idopsis) , U.S. Patent No. 5,349,126 "Process and 
composition for increasing squalene and sterol accumulation 
in higher plants" (increase in squalene and sterol 
5 accumulation by increasing HMGR activity in transgenic 
tobacco, cotton, soybean, tomato, alfalfa, Az:^idopsis, 
corn, barley, carrot and guayule plants) , and EP 486290 
(enhancement of squalene and specific sterol [squalene 
zymosterol , cholest-7 , 24 -dienol , cholest-5 , 7 , 24-trienol] 
10 accumulation in yeast by increasing HMGR activity in yeast 
deficient in ensymes that convert squalene to ergosterol) , 

In those patents, the amount of a protein exhibiting 3 
hydroxy- 3 -methylglutaryl Coenzyme-A reductase (HMGR) 
activity is typically increased. HMGR widens a "bottleneck 
15 near the beginning of a biosynthetic path to steroid 

production, permitting a higher carbon flux through steroid 
biosynthetic pathways and resulting in increased sterol 
accumulation . 

U.S. Patent No. 5,480,805 "Composition for modulating 
20 sterols in yeast" (enhancement of delta 8-7 isomerase 

activity-ERG2 enhances accumulation of specific sterols in 
yeast) . 

U.S. 5,460,949 "Method and composition for increasing 
the accumulation of squalene and specific sterols in yeast" 

25 (increasing squalene, zymosterol and specific sterols in 

yeast by increasing HMGR in yeast having decreased' erg5 and 
ergS activity- -Sc and hamster HMGR) . 

WO 9845457 (SMTI, ErgS from A. t., corn, yeast; 
transgenic plants with altered sterol level sousing DNA 

30 encoding an enzyme binding a first sterol and producing a 
second sterol- -altered carotenoid, tocopherol, modified FA 
levels- -HMGR, 5a-reductase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, 
isopentenyl diphosphate isomerase) . 
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Acetate is the metabolic precursor of a vast array of 
.compounds vital for cell and organism viability. Acetyl 
coenzyme A (CoA) reacts with acetoacetyl CoA to form 3- 
hydroxy- 3 methylglutaryl CoA (HMG-CoA) . HMG-CoA is reduced 
5 to mevalonate in an irreversible reaction catalyzed by the 
enzyme HMG-CoA reductase. Mevalonate is pho^phorylated and 
decarboxylated to isopentenyl -pyrophosphate (IPP) . Through 
the sequential steps of isomerization, condensation and 
dehydrogenation, IPP is converted to geranyl pyrophosphate 

10 (GPP). GPP combines with IPP to form farnesyl pyrophosphate 
(FPP) , two molecules of which are reductively condensed to 
form squalene, a 3 0 -carbon precursor of sterols. 

A key enzyme in sterol biosynthesis is 3 -hydroxy- 3- 
methylglutaryl -Coenzyme A reductase (HMG-CoA reductase or 

15 HMGR) . Schaller et al (Plant Physiol. 109: 761-770, r995) 

found that over-expression of rubber HMGR {hmgl) genomic DNA 
in tobacco leads to the overproduction of sterol end- 
products (sitosterol, campesterol and stigmasterol) up to 6- 
fold in leaves. Further, the excess sterol was stored as 

20 steryl-esters in lipid bodies. HMGR activity was increased 
by 4- to 8- fold. 

Sterols are derivatives of a fused, reduced ring 
system, cyclopenta- [a] ^phenanthrene, comprising three fused 
cyclohexane rings (A, B, and C) in a phenanthrene 

25 arrangement, and a terminal cyclopentane ring (D) having the 
formula (I) and carbon atom position numbering shown below: 
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where R is an 8 to 10 carbon- atom side chain. 

In plants, sgualene is converted to squalene epoxide, 
which is then cyclized to form cycloartenol (4,4,14a- 
5 trimethyl-gp, 19-cyclo-5a-cholest-24-en-3p-ol) . 

Cycloartenol has two methyl groups at position 4, a methyl 
group at position 14, a methylene bridge between the carbon 
atoms at positions 9 and 19 that forms a disubstituted 
cyclopropyl group at those positions, and includes an 8- 
10 carbon sidechain of the formula: CH3CH {CH2) 2CH=C (CH3) 2 • 
Squalene epoxide can alternatively be converted into 
pentacyclic sterols, containing five instead of four rings. 
Exemplary pentacyclic sterols include the phytoalexins and 
. saponins . 

15 Being one of the first sterols in the higher plant 

biosynthetic pathway, cycloartenol serves as a precursor for 
the production of numerous other sterols. In normal plants, 
cycloartenol is converted to predominantly 24 -methylene 
cycloartenol (4,4, 14a-dimethyl-9p, 19-cyclo-22 , 23-dihydro- 

20 ergosta-24 (28) -en-3-p-ol) , cycloeucalenol , (4 , 14a-trimethyl- 
9P,19 cyclo-5a-ergosta-24 (28) -en-3p-ol) , isof ucosterol (5a- 
stigmasta-5-24 (28) -dien-3p-ol) , sitosterol (5a-stigmasta-5- 
en-3p-ol) , stigmasterol- (stigmasta-5 , -22-dien-3P-ol) , 
campesterol (5a-ergosta-5-en-3p-ol) , and cholesterol (5a- 

25 cholesta-5-en-3p-ol) . These transformations are illustrated 
in Fig. 1. 

Although sterols produced by plants, and particularly 
higher (vascular) plants, can be grouped by the presence or 
absence of one or more of several functionalities, plant 
30 sterols are classified into two general groups herein; i.e., 
those containing a double bond between the carbon atoms at 
positions 5 and 6 (delta-5 or A5 sterols) and those not 
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containing a double bond between the carbon atoms at 
positions 5 and 6 (non-delta-5 sterols) . 

Exemplary naturally-occurring delta-5 plant sterols are 
isof ucosterol , sitosterol , stigmasterol , campesterol , 

5 cholesterol, and dihydrobrassicasterol . Exemplary naturally 
occurring non-delta~5 plant sterols are cycloartenol , 24- 
methylene cycloartenol, cycloeucalenol , and obtusif oliol . 
The most abundant sterols of vascular . plants are 
campesterol, sitosterol, and stigmasterol, all of which . 

0 contain a double bond between the carbon atoms at positions 
5 and 6 are classified as delta-5 sterols. 

The HMG-CoA reductase enzymes of animals and yeasts are 
integral membrane glycoproteins of the endoplasmic 
reticulum. The intact enzyme comprises three regions: a 

5 catalytic region containing the active site of the enzyme; a 
membrane binding region anchoring the enzyme to the 
endoplasmic reticulum; and a linker region joining the 
catalytic and membrane binding regions of the enzymes. The 
membrane binding region occupies the amino- terminal (N- 

D terminal) portion of the intact protein, whereas the 

catalytic region occupies the carboxy- terminal (C-terminal) 
portion of the protein, with the linker region constituting 
the remaining portion. M.E. Basson et al., Mol, Cell Biol., 
8 (9) :3797-3808 (1988). 

5 The activity of HMG-CoA reductase in animals and yeasts 

is known to be subject to feedback inhibition by sterols. 
Such feedback inhibition requires the presence of the 
membrane binding region of the enzyme. See, e.g., G. Gil et 
al. Cell, 41:249-258 (1985); M. Bard and J.F. Downing, J". 

} Gen. Microbiol., 124:415-420 (1981). 

Given that mevalonate is the precursor for sterols and 
other isoprenoids, it might be expected that increases in 
the amount or activity of HMG-CoA reductase would lead to 
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increases in the accumulation of both sterols and other 
isoprenoids . 

In mutant strains of the yeast Saccha^romyces cerevisiae 
(S. cerevisiae) having abnormally high levels of HMG-CoA 
5 reductase activity, the production of two sterols, 4,14- 
dimethyl zymosterol and 14-methylfucosterol is markedly 
increased above normal. Downing, et al . , Biochem. Biophys. 
Res. Comm., 94(3): 874-979 (1980). 

When HMG-CoA reductase activity was increased by 
10 illumination in non- photo synthetic microorganisms, 

isoprenoid (carotenoid) , but not sterol (ergosterol) , 
synthesis was enhanced. Tada, et al . , Plant and Cell 
Physiology, 23 (4) : 615-621 (1982) '. 

WO 9703202 discloses a method for identifying agents 
15 modulating sterol biosynthesis using a yeast acetoacetyl CoA 
thiolase (ERGIO) gene linked to a reporter system to 
evaluate compounds, such as lovastatin and other HMG-CoA 
synthase inhibitors, that affect cholesterol biosynthesis. 
U.S. Patent No. 5,668,001 teaches a recombinant avian 
20 HMG-CoA synthase preparation useful for evaluating drugs 
that inhibit cholesterol biosythesis. 

JP 09121863 discloses a plant with increased 3-hydroxy- 
3 -methyl glutaryl coenzyme A reductase (HMGR) activity as a 
result of increasing the expression of a mutant protein 
25 kinase gene that regulates expression of the HMGR gene. The 
increased HMGR activity increases squalene, zymosterol, 
cholesta-7 , 24-dienol and cholest-5 , 7, 24 -trenol accumulation 
in yeast with ERGS and ERG6 mutants. 

EP 48 073 0 "Plant -sterol accumulation and pest 
30 resistance -by increasing copy number of 3 -hydroxy- 3 -methyl 
glutaryl coenzyme -A reductase gene in tobacco, tomato and 
corn 

WO 9913086 "Human Delta 7 -sterol reductase polypeptide- 
useful for diagnosis or treatment of genetic defects e.g. 

05>Ofi107PAP I > 
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hereditary Smith-Lemli-Opitz syndrome" teaches making and 
using the recombinant polypeptide with humans. 

Chappell et al . U.S. Patent No. 5,589,619 teaches that 
transformation of higher plants with truncated HMG-CoA 
5 reductase enhanced the production of squalene, cycloartenol 
and certain sterols, particularly compounds having 
unsaturations at the 5-position. Several intermediate 
sterols as are shown in Fig. 1 were also produced. It would 
be beneficial if the production of sitosterol and 

10 stigmasterol could be enhanced while lessening the 

accumulation of the intermediate sterols. The present 
invention provides avenues for enhancing production of 
sitosterol and stigmasterol and lessening the accumulation 
of the intermediate sterols. 

15 Gonzalez et al . (Abstract of poster at Third Terpnet 

Meeting of the European Network on Plant Isoprenoids, May 
29-30, 1997, Poitiers, France) over-expressed the 
ArahicLopsis HMGR cDNA {hmgl and hmg2) and found sterol • 
overproduction with hmgl only. They used two forms of -the 

20 hmgl gene, a full-length form and a truncated form 

containing only the catalytic domain. HMGRs have three 
domains, an N-terminal membrane spanning domain, a short 
linker domain, and a C- terminal catalytic domain. In this 
case the transgenic plants were also AraJbidopsis. The 

25 difference between the full-length and truncated forms was a 
greater accumulation of pathway intermediates in the case of 
the truncated form. More importantly, the intermediates 
demonstrated as accumulating were cycloartenol, 24- 
methylenecycloartanol and obtusif oliol . 

30 Finally, US patents 5,365,017 and 5,306,862, both 

assigned to Amoco Corp., disclose a method for increasing 
sterol accumulation in plants by increasing the copy number 
of a gene having HMG-CoA reductase activity. These 
inventions disclose a method using hamster truncated HMGR 
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that consisted of the catalytic domain and the linker 
domain. According to the claims the linker domain was 
essential for activity. They also demonstrated a greater 
accumulation of pathway intermediates such as cycloartenol . 

5 

BRIEF SUMMARY 

The present invention relates to transgenic plants and 
their progeny having improved nutritional characteristics. 
More particularly, the present invention relates to 

10 transgenic plants and their progeny, the storage organs 
(e.g. seed, fruit and vegetable parts) of which contain 
modified levels of steroid compounds, such as (i) elevated 
levels of beneficial phytosterols (e.g., sitosterol), 
phytostanols (e.g., sitostanol) , and esters thereof, 

15 relative to an otherwise identical plant transformed only 
with a truncated HMG-CoA reductase gene or a wild-type 
plant, and (ii) reduced levels of steroid pathway 
intermediate compounds (e.g. one or more of squalene, 
cycloartenol, 2 4 -methylene cycloartenol, obtusif oliol , 

20 stigmasta-7-enol and campesterol) in their storage organs 
relative to an otherwise identical transgenic plant 
transformed only with a truncated HMG-CoA reductase gene. 
Nucleic acid sequences encoding enzymes that affect the 
biosynthesis and accumulation of steroid compounds in plants 

25 (HMG-CoA reductase and a steroid pathway enzyme) , and 

methods for using these sequences to produce such transgenic 
plants, are also provided. These methods comprise, for 
example, introducing into cells nucleic acid sequences 
encoding enzymes that affect the levels of accumulated 

30 steroid pathway end products. 

The present invention contemplates a recombinant 
construct or a recombinant vector that contains 2 DNA 
sequences. The first encodes a polypeptide exhibiting 3- 
hydroxy- 3 -methylglutaryl -Coenzyme A (HMG-CoA) reductase 

BNSOCXID: <WO 02061 072A2 I > 
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activity. The second DNA sequence encodes a polypeptide 
exhibiting the activity of another steroid pathway enzyme. 
Each polypeptide-encoding DNA seqpience is operably linked in 
the 5 ' to 3 * direction to a promoter and a transcription 
5 termination signal sequence independent of the other 
sequence- The promoter is located upstream and the 
termination sequence downstream of each polypeptide -encoding 
DNA sequence. The second DNA sequence encoding a steroid 
pathway enzyme can code for a squalene epoxidase enzyme, a 

10 sterol methyl transferase I enzyme, a sterol C4-demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, or a sterol methyl transferase II enzyme. 
It is contemplated that HMG-CoA reductase and the steroid 
pathway enzyme activity comes from a mutant or truncated 

15 form of those enzymes, such as a truncated HMG-CoA reductase 
lacking the transmembrane region while retaining a 
functional catalytic domain. Examples of such preferred HMG 
CoA reductases include the truncated rubber and Arabidopsls 
HMG CoA reductases disclosed herein. 

20 Preferably, the regulatory function of a promoter is 

substantially unaffected by cellular levels of squalene such 
as the CaMV 35S promoter. In one aspect, a promoter is 
seed-specific. In another aspect, a promoter is derived 
from a species in a different order from a host cell - In 

25 another aspect, the HMG-CoA reductase or steroid pathway 
enzymes is from a species in a different order from the 
order that of the host cell . The invention contemplates a 
construct or recombinant vector having more than one DNA 
sequence encoding a steroid pathway enzyme that do not have 

30 to be under the control of the. same promoter. Preferably, a 
recombinant vector is a plant expression vector. 

In another aspect of the invention, a transformed host 
cell comprises a recombinant construct or vector as 
described above. Preferably, a host cell is plant cell, 

020ei072A2.l_> 
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preferably that plant cell is from canola, soybean, corn, 
maize, tobacco, cotton, rape, tomato or alfalfa. The 
invention contemplates a host cell in a cell culture, plants 
derived from transformed host cells, and storage organs 
5 (seeds, fruits and vegetable parts) from transgenic plants. 
In addition to contemplating transgenic plants and 
seeds, the invention contemplates transgenic plant seeds 
capable of germinating into a transgenic plant and mutants, 
recombinants, genetically engineered derivatives thereof and 
10 hybrids derived therefrom. The plant over- accumulates 
steroid pathway products relative to a native, non- 
transgenic plant of the same strain, wherein said mutants, 
recombinants, genetically engineered derivatives thereof and 
hybrids derived therefrom maintain the ability to 
15 overaccumulate steroid pathway products. 

The invention contemplates a process of increasing the 
formation of steroid pathway products in a transformed host 
cell as compared to an otherwise identical non- trans formed 
host cell. Contemplated processes use the described 
recombinant constructs and vectors to transform host cells, 
then growing the host cells or regenerating transgenic 
plants therefrom. 

In one aspect of the invention, the genome of a 
contemplated plant, its progeny, seeds or cell culture, 
comprises introduced DNA encoding an HMG-CoA reductase 
activity and introduced DNA encoding a steroid pathway 
enzyme that is a squalene epoxidase enzyme, a sterol methyl 
transferase I enzyme, a sterol C4-demethylase enzyme, a 
obtusifoliol C14a-demethylase enzyme, a sterol C5-desaturase 
enzyme, or a sterol methyl transferase II enzyme. The 
storage organs of such a plant contain an elevated level of 
total accumulated sterol, compared to storage organs of an 
otherwise identical plant, the genome of which does not 
comprise sai4 introduced DNA. Further, the storage organs 
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of the plant contain a reduced level of squalene, 
cycloartenol, 24 -methylene cycloartenol , obtusif oliol , 
stigmasta-7-enol, or campesterol compared to the seeds of an 
otherwise identical plant or a plant comprising an 
5 introduced DNA encoding an HMG-CoA reductase enzyme. 

The invention contemplates a method of producing a 
plant that accumulates an elevated level of sterol pathway 
products compared to a corresponding plant comprising no 
introduced DNA encoding a peptide, polypeptide, or protein 

10 that affects the biosynthesis and accumulation of a sterol 
pathway product, comprising sexually crossing plants to 
arrive at a plant comprising nucleic acid encoding an HMG 
CoA reductase and a steroid pathway enzyme, including 
crosses with a nurse cultivar. The plants, including 

15 apomicitic plants, uniform populations of the plants and 
their seeds and parts other than seeds are contemplated. 

Another aspect of the invention is oils containing at 
least one sterol pathway product, extracted from the seeds 
of a contemplated plant. Preferably sitosterol, at least 

20 one sitosterol ester, or mixtures thereof, comprise at least 
about 57% by weight of the total sterol compounds of a 
contemplated oil. Preferably sitosterol, that at least one 
sitosterol ester, or mixtures thereof, comprise at least 
about 0.08% of the dry weight of a contemplated seed. 

25 Preferably, the oil has a reduced amount of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
stigmasta-7-enol , campesterol, or combinations thereof, 
compared to oil from a corresponding transgenic plant that 
does not contain introduced DNA encoding a squalene 

30 epoxidase enzyme, a sterol methyl transferase I enzyme, a 
sterol C4-demethylase enzyme, a obtusif oliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, a sterol 
methyl transferase II enzyme, or mixture thereof; wherein 

_02061072A2^l_> 
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the reduction is in the range of from about 10% to about 
100%. 

Sitosterol ester compositions derived from transgenic 
plants of the present invention, their progeny or their 
5 seeds are also contemplated, preferably wherein an 

esterifying fatty acid has 2 to 22 carbon atoms in the main 
chain. 

A further aspect of the invention is cholesterol- 
lowering compositions comprising contemplated oils and 

10 sitosterol ester compositions. Another further aspect of 
the invention is foods, food ingredients, or food 
compositions comprising contemplated oils. 

Still further, the invention contemplates 
pharmaceutical compositions comprising a cholesterol - 

15 lowering effective amount of a contemplated oil, and a 

pharmaceutically acceptable carrier, excipient, or diluent. 

A method of lowering the plasma concentration of low 
density lipoprotein cholesterol is contemplated, comprising 
orally administering to a human or animal subject an 

20 effective amount of an above composition. Also contemplated 
is a method of treating or preventing an elevated plasma 
concentration of low-density lipoprotein cholesterol, 
comprising orally administering to a human or animal subject 
an effective amount of a contemplated composition. 

25 A related aspect of the invention is a method of making 

a food additive composition, comprising obtaining oil 
containing a sterol pathway product compound from seed of a 
contemplated transgenic plant and mixing the oil with an 
edible solubilizing agent, an effective amount of a 

30 dispersant, and optionally, an effective amount of an 
antioxidant . 

Novel forms of two sterol pathway enzymes and the 
nucleic acids that encode them are disclosed: an Arahidopsis 
enzyme having nucleic acid similarity to a squalene 
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epoxidase, and an Arabidopsis enzyme having nucleic acid 
similarity to an obtusifoliol C14a-demethylase enzyme. 
Thus, the invention contemplates an isolated DNA molecule 
having a nucleotide sequence of disclosure SEQ ID NO: 4, 6, 

5 8, 10, 14, 15, 11 OT the complements thereof. Also 

contemplated is a nucleotide sequence that hybridizes to the 
nucleotide sequence of SEQ ID NO: 4, 6, 8, 10, 14, 15, 11 ox 
their complements under a wash stringency equivalent to 0 . 5X 
SSC to 2X SSC, 0.1% SDS, at 55-65<=>C, and that encode a 

10 polypeptide having squalene epoxidase or obtusifoliol C14a- 
demthylase enzymatic activity. Preferably, that enzymatic 
activity is substantially similar to that of a disclosed 
squalene epoxidase or obtusifoliol C14a-demethylase, 
respectively. By substantially smiliar is meant having 

15 enzymatic activity differing from that of the disclosed 
enzymes by about 30% or less, preferably by about. 20% or 
less, and more preferably by about 10% or less when 
assayed by standard enzymatic assays. Also contemplated 
is a nucleotide sequence encoding the same genetic 

20 information as said nucleotide sequence of SEQ ID NO: 4, 6, 
8, 10, 14, 15, 17 or their complements or that hybridize as 
described above, but which is degenerate in accordance with 
the degeneracy of the genetic code. Recombinant constructs, 
vectors and transformed host cells comprising the novel 

25 isolated and purified nucleic acid sequences are also 
contemplated. In one embodiment, the vector is a plant 
vector and the host cell is a plant cell. Methods of 
producing the disclosed squalene epoxidase or obtusifoliol 
C14a-demethylase enzymes are also contemplated coirprising 

30 culturing a transformed host cell for a time and under 
conditions conductive to the production of the squalene 
epoxidase or obtusifoliol C14a-demethylase enzyme, and 

02061072A2 I > 
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recovering the produced squalene epoxidase or obtusifoliol 
C14a-deTnethylase enzyme . 

Yet another aspect provides any of the above described 
transformed host cells, further comprising a recombinant 
5 construct or expression vector encoding a tocopheral 
synthesis pathway enzyme, and in particular, S- 
adenosylmethionine- dependent a-tocopherol methyltransf erase . 
Also included are plants, seeds and storage organs 
comprising the transformed host cells. 

10 Another aspect provides, a process of increasing the 

formation of steroid pathway products and tocopherols in a 
transformed host cell as compared to an otherwise identical 
non- transformed host cell comprising (1) transforming a host 
cell with a recombinant vector comprising (a) as operably 

15 linked components in the 5' to 3 * direction, a promoter, a 
DNA sequence encoding a first polypeptide having 3-hydroxy- 
3 -me thylglutaryl -Coenzyme A reductase enzyme activity, and a 
transcription termination signal sequence; and (b) as 
operably linked components in the 5 ' to 3 ' direction, a 

20 promoter, a DNA sequence encoding at least one polypeptide 
having steroid pathway enzyme activity selected from the 
group consisting of squalene epoxidase enzyme activity, 
sterol methyl transferase I enzyme activity, sterol C4- 
demethylase enzyme activity, obtusifoliol C14a-demethylase 

25 enzyme activity, sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity, and a 
transcription termination signal sequence; (2) transforming 
the host cell of (1) with a recombinant vector comprising as 
operably linked components, a promoter, a DNA sequence 

30 encoding a tocopherol synthesis pathway enzyme, and a 

transcription termination sequence; and (3) regenerating 
said transformed plant cell into said transgenic plant . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of 
the present invention will become better understood with 
. 5 regard to the following description, appended claims and 
accompanying figures where : 

1 Fig. 1 is an abbreviated version of a plant steroid 

\ 

\ compound biosynthetic pathway that shows the enzymes 

affecting steroid compound biosynthesis and accumulation. 

10 These include: HMG-CoA reductase, squalene epoxidase, sterol 
methyl transferase I, sterol C4-demethylase, obtusifpliol 
C14a-demethylase, sterol C5 desaturase and sterol methyl 
transferase II. 

Fig. 2 depicts the forms of Arabldopsls and rubber^ 

15 HMGRl tested in Arahidopsis and yeast to compare expression, 
activity and sterol production. 

Fig. 3 is a map showing the structure of construct.. 
pMON29920. pMON29920 is a binary transformation vector with 
P-7S/E9 3' cassette and the KSN gene flanked by the two. 

20 borders where P-7S is the promoter of alpha' beta 

conglycinin protein from soybean, E9 3 ' is the 3' end of pea 
rbc E9 gene and KAN is the coding sequence f or NPTII that 
confers resistance to kanamycin. The NPTII gene is driven 
by the 3 5S promoter from cauliflower mosaic virus. Spc.Str 

25 is the coding region for Tn7 adenylyltransf erase conferring 
resistance to spectinomycin and streptomycin; ori-V: the 
vegetative origin of replication; rop: coding region for 
repressor of primer; ori-322 : minimum known sequence 
required for a functional origin of replication; NOS 3': the 

30 3' termination end of nopaline synthase coding region. 

Fig. 4 is a map showing the structure of construct 
PMON43800. PMON43800 is a recombinant binary vector for 
Agrobacterium-mediated transformation, carrying the r\abber 
HMGRl gene cassette. The HMGRl gene is driven by the 7S 
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alpha' beta conglycinin promoter from soybean. P-7S: 7S 
promoter, rubber HMGRl gene: coding sequence for 3-hydroxy- 
3 -methyl glutaryl reductase from Hevea brasiliensis ; E9 3': 
3' end of pea rbcS E9 gene; P-35S: 35S promoter from 
5 cauliflower mosaic virus; KAN: coding region for NPTII gene 
confer-ring resistance for kanamycin; NOS 3': 3' termination 
end of nopaline synthase coding region: Left Border: 
Octapine left border from Octapine Ti plasmid pTiA6; ori-V: 
the vegetative origin of replication; rop: coding region for 

10 repressor of primer; Spc/Str: coding region for Tn7 
adenylyl transferase conferring resistance to 
spect inomycinand streptomycin . 

Fig, 5 is a map showing the structure of construct 
pM0N23616. pMON23G16 is a plant expression vector 

15 containing P-NOS/ORF-7/KAN/NOS-3 ' cassette. P-NOS : NOS 
promoter from Agrobacterium tumefaciens pTiT37; ORF-7: a 
short open reading frame that attenuates expression of KAN 
in plants; KAN: coding sequence of NPTII gene that confers 
resistance to kanamycin and neomycin; ble: confers 

20 resistance to bleomycin; NOS 3' : 3' termination end of 

nopaline synthase coding region; Left Border: Octapine left 
border from Octapine Ti plasmid pTiA6; ori-V: the vegetative 
origin of replication; rop: coding region for repressor of 
primer; Spc/Str: coding region for Tn7 adenylyltransf erase 

25 conferring resistance to spect inomycin and streptomycin. 

Fig. 6 is a map showing the structure of construct 
pMON43818. pMON43818 is a recombinant binary vector 
carrying the gene encoding rubber hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 

30 soybean alpha' beta conglycinin promoter. P-NOS: nopaline 
synthase gene promoter; kan: coding region for neomycin 
phospho transferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline synthase 
coding region; Soy Alpha' Beta Conglycinin: 7S alpha' beta 
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conglycinin gene promoter froTn soybean; Rubber HMGRl gene: 
coding sequence for HMGRl gene from Hevea hrasillensis; E9 
3': 3' end of pea rbcS E9 gene; Left border: octopine left 
border, sequence essential for transfer of T-DNA into 
5 Agrohacterixmi} ori-V: plasmid origin of replication in 

AgrjroJbacte2riu277; . rop: coding sequence for repressor of primer; 
Ori-322: origin of replication in ^.coli; Spc/Str: coding 
region for Tn7 adenylyltransf erase (AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right Border: 

10 right border sequence of T-DNA essential for integration 
into AgrohactGirium. 

Figure 7 is a map showing the structure of construct 
pMON43052. pMON43052 is a recombinant shuttle vector, 
carrying the cDNA fragment encoding the catalytic domain of 

15 Arabidopsis HMGRl in sense orientation driven by the soybean 
alpha' beta conglycinin promoter. P-NOS: nopaline synthase 
gene promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline synthase 

20 coding region; Soy Alpha' Beta Conglycinin: 7S alpha' beta 
conglycinin gene promoter from soybean; Arabidopsis HMGR 
catalytic domain: coding sequence for the catalytic domain 
of Arabidopsis HMGRl protein; E9 3' : 3' end of pea rbcS E9 
gene; Left border: octopine left border, sequence essential 

25 for transfer of T-DNA into Agrrobactejrium; ori-V: plasmid 

origin of replication in Agrobacterium; rop : coding sequence 
for repressor of primer; Ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 adenylyltransf erase 
(AAD(3")) conferring resistance to spectinomycin and 

30 streptomycin; Right Border: right border sequence of T-DNA 
essential for integration into AgroJbacteJrium. 

Figure 8 is a map showing the structure of construct 
pMON51850. pMON51850 is a binary vector for AgrroJbactejrium 
mediated transformation of soybean. P-NOS: nopaline 
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synthase gene promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline synthase 
coding region; Left border: octopine left border sequence 
5 essential for transfer of T-DNA into Agrrojbacterium; ori-V: 
plasmid origin of replication in Agrrojbacterium; rop: coding 
sequence for repressor of primer; ori-322: origin of 
replication in E.coli; Spc/Str: coding region for Tn7 
adenylyl transferase (AAD(3")) conferring resistance to 

10 spectinomycin and streptomycin; Right Border: right border 
sequence of T-DNA essential for integration into 
AgrroJbac teri um . 

Figure 9 is a map showing the structure of construct 
pMON43057. pMON43057 is a recombinant binary vector for 

15 AgrohactGrium mediated transformation of soybean, carrying 
the gene cassette for expressing catalytic domain of HMGRl 
from Arabidopsis thaliajia.. The catalytic domain of the 
HMGRl cDNA is driven by soybean 7S alpha' beta conglycinin 
promoter. P-NOS : nopaline synthase gene promoter; kan: 

20 coding region for neomycin phosphotransferase protein to 
confer resistance to kanamycin; NOS 3': 3' termination end 
of nopaline synthase coding region; Left border: octopine 
left border sequence essential for transfer of T-DNA into 
AgrroJbacterium; ori-V: plasmid origin of replication in 

25 Agrohacterium; rop: coding sequence for repressor of primer; 
ori-322: origin of replication in E.coli} Spc/Str: coding 
region for Tn7 adenylyltransf erase (AAD(3")) conferring 
resistance to spectinomycin and streptomycin; Right Border: 
right border sequence essential for transfer of T-DNA into 

30 AgrohactGrium; Soy Alpha' Beta Conglycinin: soybean 7S 
alpha' beta conglycinin gene promoter; Arahldopsis HMGR 
catalytic domain: coding sequence for Arabidopsis HMGRl 
catalytic domain; E9 3' : 3' end of pea rbcS E9 gene. 
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Figure 10 is a map showing the structure of construct 
pMON43058. pMON43058 is a recombinant binary vector for 
AgrjroJbacteriujn-mediated soybean transformation, carrying gene 
expression cassettes for catalytic domain of HMGRl from 
5 Arahidopsis thaliana and SMTII from Arahldopsis thaliana. 
P-NOS: nopaline synthase gene promoter; kan:. coding region 
for neomycin phosphotransferase protein to confer resistance 
to kanamycin; NOS 3' : 3' termination end of nopaline 
synthase coding region; Left border: octopine left border 

10 sequence essential for transfer of T-DNA into Agrdbacteriunt; 
ori-V: plasmid origin of replication in Agrr-oJbacterium; rop: 
coding sequence for repressor of primer; ori-322: origin of 
replication in E.colij Spc/Str: coding region for Tn7 
adenylyltransf erase (AAD(3")) conferring resistance to- 

15 spectinomycin and streptomycin; Right Border: right border 
sequence essential for transfer of T-DNA into Agrobacterium; 
Soy Alpha' Beta Conglycinin: 7S alpha' beta conglycinin gene 
promoter from soybean; Arabidopsis HMGR catalytic domain: 
sequence encoding the catalytic domain of Ajrabi dopsis -HMGRl ; 

20 E9 3' : 3' end of pea rbcS E9 gene; Soy Alpha' Beta 
Conglycinin: soybean 7S alpha 'beta conglycinin gene 
promoter; Arabidopsis SMT2 : cDNA encoding sterol methyl 
transferase II enzyme from Arabidopsis thaliana (accession 
no: X89867) ; NOS 3' : 3' termination end of nopaline. synthase 

25 coding region. 

Fig. 11 is profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when Arabidopsis 
truncated HMGR (catalytic domain without linker) was 
overexpressed using seed- specif ic 7S promoter (data from 

30 pMON43 057 :p7S : :At HMGR truncated) . 

Fig. 12 is a profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when Arabidopsis 
truncated HMGR (catalytic domain without linker) and 
Arabidopsis SMTII were overexpresed (data from 
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pMON43 058:p7S: :At HMGR truncated and p7S::At SMTII) . The 
expression of the genes is controlled by the seed-specific 
IS promoter. 

Figure 13 is a map showing the structure of construct 
5 pMON53733. piyiON53733 is a recombinant binary vector 

carrying the cDNA encoding full-length form of Arahidopsis 
hydroxymethyl glutaryl CoA reductasel (HMGRl) in sense 
orientation driven by the enhanced cauliflower mosaic virus 
35S promoter. P-35S: 35S promoter from cauliflower mosaic 

10 virus; kan: confers resistance to neomycin and kanamycin; 
NOS 3' : 3' termination end of nopaline synthase coding 
region; Left border: oct opine left border, sequence 
essential for transfer of T-DNA into AgrroJbacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: coding 

15 sequence for repressor of primer; ori-322: origin of 
replication in E.coli} Spc/Str: coding region for Tn7 
adenylyltransf erase (AAD(3")) conferring resistance to 
spectinomycin and streptomycin; Right Border: right border 
sequence of T-DNA essential for integration into 

20 • Agrobacterlum; P-e35S: enhanced cauliflower mosaic virus 
promoter; Arabidopsis HMGRl: cDNA sequence encoding full- 
length form of Arabidopsis HMGRl; E9 3' : 3' end of pea rbcS 
E9 gene. 

Figure 14 is a map showing the structure of construct 
25 pMON53734- pMON53734 is a recombinant binary vector 

carrying the cDNA encoding catalytic domain with linker 
region of Arabidopsis hydroxymethyl glutaryl CoA reductasel 
(HMGRl) in sense orientation driven by the enhanced 
cauliflower mosaic virus 35S promoter. P-35S: 35S promoter 
30 from cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3' : 3' termination end of 
nopaline synthase coding region; Left border: octopine left 
border, sequence essential for transfer of T-DNA into 
A^rrojbacteriujTi; ori-V: plasmid origin of replication in 
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Agroha.ctGriuin} rop : coding sequence for repressor of primer; 
ori-322: origin of replication in E.colij Spc/Str: coding 
region for Tn7 adenylyltransf erase (AMD (3")) conferring 
resistance to spectinomycin and streptomycin; Right Border: 
5 right border sequence of T-DNA essential for integration 
into Agrrojbactejrium; P-e35S: enhanced cauliflower mosaic 
virus promoter; Arabidopsis tHMGRl : cDNA sequence encoding 
catalytic domain with linker region of Arabidopsis HMGRl; E9 

-5-' • 

3' : 3' end of pea rbcS E9 gene. 

10 Figure 15 is a map showing the structure of construct 

pMON53735. pMON53735 is a recombinant binary vector 
carrying the cDNA encoding catalytic domain without the 
linker region of Arabidopsis hydroxymethyl glutaryl CoA 
reductasel (HMGRl) in sense orientation driven by the 

15 enhanced cauliflower mosaic virus 35S promoter. P-35S: 35S 
promoter from cauliflower mosaic virus; kan: confers 
resistance to neomycin and kanamycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Left border: 
octopine left border, sequence essential for transfer bf T- 

20 DNA into AgrroJbactejrium; ori-V: plasmid origin of replication 
in Agrobacterium; rop: coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli; Spc/Str: 
coding region for Tn7 adenylyltransf erase (AAD(3")) 
conferring resistance to spectinomycin and streptomycin; 

25 Right Border: right border sequence of T-DNA essential for 

integration into Agrobacterium; P-e35S: enhanced cauliflower 
mosaic virus promoter; Arabidopsis cHMGRl : cDNA sequence 
encoding catalytic domain without the linker region of 
Arabidopsis HMGRl; E9 3' : 3' end of pea rbcS E9 gene. 

30 Figure 16 is a map showing the structure of construct 

pMON53736. pMON53736 is a recombinant binary vector 
carrying the cDNA encoding full-length form of rubber (Hevea 
brasiliensis) hydroxymethyl glutaryl CoA reductasel (HMGRl) 
in sense orientation driven by the enhanced cauliflower 
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mosaic virus 3 5S promoter. P-3 5S: 35S promoter from 
cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3': 3' termination end of 
nopaline synthase coding region; Left border: octopine left 
5 border, sequence essential for transfer of T-DNA into 
AgrroJbacteriuzn; ori-V: plasmid origin of replication in 
Agrrojbacterluni; rop: coding sequence for repressor of primer; 
ori-322: origin of replication in E.coli; Spc/Str: coding 
region for Tn7 adenylyltransf erase (AAD(3")) conferring 

10 resistance to spectinomycin and streptomycin; Right Border: 
right border sequence of T-DNA essential for integration 
into AgrroJbacterium; P-e35S: enhanced cauliflower mosaic 
virus promoter; Hevea HMGRl : cDNA sec[uence encoding full- 
length form of rubber HMGRl; E9 3' : 3' end of pea rbcS E9 

15 gene . 

Figure 17 is a map showing the structures of construct 
pM0N53737. pMON53737 is a recombinant binary vector 
carrying the cDNA encoding catalytic domain with linker 
region of rubber (Hevea brasiliensls) hydroxymethyl glutaryl 

20 CoA reductasel (HMGRl) in sense orientation_driven by the 

enhanced cauliflower mosaic virus 35S promoter. P-35S: 35S 
promoter from cauliflower mosaic virus; kan: confers 
resistance to neomycin and kanamycin; NOS 3' : 3' termination 
end of nopaline synthase coding region; Left border: 

25 octopine left border, sequence essential for transfer of T- 
DNA into Agrobacteriuw} ori-V: plasmid origin of replication 
in Agrobact:e:rium; rop: coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli; Spc/Str: 
coding region for Tn7 adenylyltransf erase (AAD(3")) 

30 conferring resistance to spectinomycin and streptomycin; 

Right Border: right border sequence of T-DNA essential for 
integration into Agrohacterixxm; P-e35S: enhanced cauliflower 
mosaic virus promoter; rxobber tHMGRl : cDNA secjuence encoding 
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catalytic domain with linker region of rubber HMGRl; E9 3 ' : 
3 ' end of pea rbcS E9 gene . 

Figure 18 is a map showing the structure of construct 
pMON53738. pMON53738 is a recombinant binary vector 
5 car2rying the cDNA encoding mutant form of rubber (Hevea 

hrasiliensis) hydroxymethyl glutaryl CoA reductasel (HMGRl) 
in sense orientation driven by the enhanced cauliflower 
mosaic virus 35S promoter. In the mutant rubber HMGRl the 
putative phosphorylation site, the serine amino acid residue 

10 at position 566 is changed to alanine amino acid residue 
(SEQ ID 23) . P-3 5S: 3 5S promoter from cauliflower mosaic 
virus; kan : confers resistance to neomycin and kanamycin; 
NOS 3': 3' termination end of nopaline synthase coding 
region; Left border: octopine left border, sequence 

15 essential for transfer of T-DNA, into Agrobacterium; ori-V: 
plasmid origin of replication in Agrohacizeriunt} rop: coding 
sequence for repressor of primer; ori-322: origin of 
replication in E.colij Spc/Str: coding region for Tn7 
adenylyltransf erase {AAD(3")) conferring resistance to- - 

20 spectinomycin and streptomycin; Right Border: right border 
sequence of T-DNA essential for integration into 
AgrohsLCterium} P-e35S: enhanced cauliflower mosaic virus 
promoter; rubber tHMGRl Ala 566: cDNA sequence encoding 
catalytic domain with linker region of rubber HMGRl in which 

25 serine amino acid residue at position 566 is changed to 

alanine amino acid residue using site directed mutagenesis; 
E9 3': 3' end of pea rbcS E9 gene. 

Figure 19 is a map showing the structure of construct 
pMON53739. pMON53739 is a recombinant binary vector 

30 carrying the cDNA encoding mutant form of rubber (Hevea 

brasiliensis) hydroxymethyl glutaryl CoA reductasel (HMGRl) 
in sense orientation driven by the enhanced cauliflower 
mosaic virus 35S promoter. In the mutant rubber HMGRl the 
putative phosphorylation site, the serine amino acid residue 
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at position 567 is changed to alanine amino acid residue 
(SEQ ID 24) . P-35S: 35S promoter from cauliflower mosaic 
virus; kan: confers resistance to neomycin and kanamycin; 
NOS 3': 3' termination end of nopal ine synthase coding 
5 region; Left border: octopine left border, sequence 

essential for transfer of T-DNA into Agrrojbacteriuni; ori-V: 
plasmid origin of replication in AgrroJbacteriuin; rop: coding 
sequence for repressor of primer; ori-322: origin of 
replication in E.coli; Spc/Str: coding region for Tn7 
10 adenylyl transferase (AAD(3")) conferring resistance to 

spectinomycin and streptomycin; Right Border: right border 
sequence of T-DNA essential for integration into 
Agrrojbacteriuin; P-e35S: enhanced cauliflower mosaic virus 
promoter; rubber tHMGRl Ala 567: cDNA sequence encoding 
15 catalytic domain with linker region of rubber HMGRl in which 
serine amino acid residue at position 567 is changed to 
alanine amino acid residue using site directed mutagenesis; 
E9 3' : 3' end of pea rbcS E9 gene. 

Figure 20 is a map showing the structure of construct 
20 PMON53740. pMON53740 is a recombinant binary vector 

carrying the cDNA encoding catalytic domain without linker 
region of rubber (Hevea hrasili^nsis) hydroxymethyl glutaryl 
CoA reductasel (HMGRl) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. P-35S: 35S 
25 promoter from cauliflower mosaic virus; kan: confers 

resistance to neomycin and kanamycin; NOS 3' : 3' termination 
end of nopal ine synthase coding region; Left border: 
octopine left border, sequence essential for transfer of T- 
DNA into Agrohacterium; ori-V: plasmid origin of replication 
30 in AgrroJbacterium; rop: coding sequence for repressor of 

primer; ori-322: origin of replication in E.coli; Spc/Str: 
coding region for Tn7 adenylyl transferase (AAD(3")) 
conferring resistance to spectinomycin and streptomycin; 
Right Border: right border sequence of T-DNA essential for 
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integration into AgrobacteTrium; P-e35S; enhanced cauliflower 
mosaic virus promoter; rubber cHMGRl : cDNA sequence encoding 
catalytic domain without linker region of rubber . HMGRl ; E9 
3' : 3' end of pea rbcS E9 gene. 
5 Fig. 21 is a graph comparing the cycloartenol content 

in micrograms of steroid compound per gram of seeds analyzed 
in transgenic Arahidopsis plants transformed with pMON53733 
through pMON5374d compared to control plants. 

Fig. 22 is a graph comparing the 24 -methylene 
10 cycloartenol content in micrograms of steroid compound per 
gram of seeds analyzed in transgenic Arahidopsis plants 
transformed with pMON53 733 through pMON53 74 0 compared to 
control plants. 

Fig. 2 3 is a graph comparing the obtusifoliol content 
15 in micrograms of steroid compound per gram of seeds analyzed 
in transgenic Arahidopsis plants transformed with pMON53733 
through pMON53740 compared to control plants. 

Fig. 24 is a graph comparing the campesterol content in 
micrograms of steroid compound per gram of seeds analyzed in 
20 transgenic Arahidopsis plants transformed with pMON53733 
through pMON53 74 0 compared to control plants. 

Fig. 2 5 is a graph comparing the sitosterol content in 
micrograms of steroid compound per gram of seeds analyzed in 
transgenic Arahidopsis plants transformed with pMON53733 
25 through pMON53740 compared to control plants. 

Fig. 26 is a graph comparing the sitostanol content in 
micrograms of steroid compound per gram of seeds analyzed in 
transgenic Arahidopsis plants transformed with pMON53733 
through pMON53740 compared to control plants. 
30 Fig. 27 is a sterol profile (histogram) of transgenic 

Arahidopsis harboring different forms of rubber HMGR. 

Fig. 2 8 is a graph of the squalene, zymosterol and 
. erogosterol content in micrograms of sterol per milligram of 
cell dry weight from HMGR constructs in yeast HMGRl knockout 
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. mutants for constructs having full length and truncated HMG 
CoA reductase (HMGR) sequences. The truncated sequences 
contain substantial portions of the catalytic region but 
lack the linker region and the transmembrane region of HMGR. 
5 These sequences are derived from Arabidapsls and rubber 
plants . 

Fig. 2 9 is a map showing the structure of construct 
pMON43842. pMON43842 is a yeast expression vector carrying 
cDNA encoding Arabidopsls putative obtusifoliol C14a~ 

10 demethylase (AC002329) in sense orientation driven by the 

p423Gall promoter. Sc.His3: HIS3 region from Saccharomyces 
cerevisi^G encoding imidazoleglycerol -phosphate dehydratase 
for histidine synthesis; Ori-fl:' bacteriophage fl origin of 
replication; LAC: contains partial lad coding sequence, 

15 promoter Plac, promoter Pt7, promoter Pt3, KS polylinker, 
and partial lacZ coding sequence; lacZ: partial coding 
sequence for beta-d-galactosidase or lacZ protein; T- 
Sc.Cycl: a terminator from Cycl- iso-1 -cytochrome c from 
Saccharomyces cerevisiae to terminate transcription; obtus . 

20 C14a. demethylase (AC002329) : cDNA encoding Arabidopsis 
putative obtusifoliol C14a-demethylase; P- Sc. Gall: a 
promoter from Gall- galactokinase of Saccharomyces 
cerevisiaG to direct expression with galactose induction; 
LacZ-alpha: partial coding sequence for beta-d-galactosidase 

25 or lacZ protein; Ori-pUC: minimum sequence required for a 
functional origin of replication, sequence downstream of 
this region is known to affect copy number when expressed in 
bacteria; AMP: contains the P3 promoter and the beta- 
lactamase coding sequence, conferring resistance to 

30 ampicillin, penicillin, and carbenicillin; Sc.2micron: 2 
micron origin of ■ replication. 

Fig. 3 0 is a map showing the structure of constrxict 
pMON43843. pMON43a43 is a yeast expression vector carrying 
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cDNA encoding Airabidopsis putative squalene epoxidase 1 
(ATA5062 63) in sense orientation driven by the p423Gall 
promoter. Sc.His3: HISS region from Saccharomyces 
cerevlslae encoding imidazoleglycerol -phosphate dehydratase 
5 for histidine synthesis; Ori-f 1 : bacteriophage fl origin of 
replication; LAC: contains partial lad coding sequence, 
promoter Plac, promoter Pt7, promoter Pt3, KS polylinker, 
and partial lacZ coding sequence; lacZ: partial coding 
sequence for beta-d-galactosidase or lacZ protein; T- 

10 Sc.Cycl: a terminator from Cycl- iso-1- cytochrome c from 
Saccharomyces cerevisiae to terminates transcription; 
Squalene epoxidase 1 (ATA506263): cDNA encoding Arabidopsis 
putative squalene epoxidase 1 (ATA506263); P-Sc . Gall a 
promoter from Gall- galactokinase of Sacchajromyces 

15 cGjrGvisiae to direct expression with galactose induction; 

LacZ-alpha: partial coding sequence for beta~d-galactosidase 
or lacZ protein; Ori-pUC: minimum sequence required for a 
functional origin of replication, sequence downstream jof 
this region is known to affect copy number when expres-sed in 

20 bacteria; AMP: contains the P3 promoter and the beta- 
lactamase coding sequence, conferring resistance to 
ampicillin, penicillin, and carbenicillin; Sc.2micron: 2 
micron origin of replication . 

Fig. 31 is a map showing the structure of construct 

25 pMON43844. pMON43844 is a yeast expression vector carrying 
cDNA encoding Arabidopsis putative squalene epoxidase 
1(ATA304243) in sense orientation driven by the p423Gall 
promoter. Sc.His3: HIS3 region from Saccharomyces 
cejTGvisiae encoding imidazoleglycerol -phosphate dehydratase 

30 for histidine synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lad coding sequence, 
promoter Plac, promoter Pt7, promoter Pt3, KS polylinker, 
and partial lacZ coding sequence; lacZ: partial coding 
sequence for beta-d-galactosidase or lacZ protein; T- 
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Sc.Cycl: a terminator from Cycl- iso-1 -cytochrome c from 
Saccharowyces cerevisiae to terminate transcription; Arab, 
squalene epoxidase 1 (ATA3 04243) : cDNA encoding Arabidopsis 
putative squalene epoxidase 1 (ATA304243) ; P-Sc.Gall: a 
5 promoter from Gall- galactokinase of Saccharomyces 

cerevisiae to direct expression with galactose induction; 
LacZ-alpha: partial coding sequence for beta-d-galactosidase 
or lacZ protein; Ori-pUC: minimum sequence required for a 
functional origin of replication, sequence downstream of 

10 this region is known to affect copy number when expressed in 
bacteria; AMP: contains the P3 promoter and the beta- 
lactamase coding sequence, conferring resistance to 
ampicillin, penicillin, and carbenicillin; Sc.2micron: 2 
micron origin of replication. 

15 Fig. 32 is a comparision of known HMG CoA reductase 

amino acid sequences. ClustalW alignment of forty- three 
non- redundant HMG- CoA reductase sequences to represent 
archaebacterial , eubacterial, fungal, plant and animal 
groups. The putative functional domains in the alignment 

20 marked as described below are based on the three 

dimnensional structure of Pseudomonas mevalonli HMGR 
(Lawrence et al . , 1995, Science 268:1758): boxed-HMGCoA 
binding domain, light shade-NAD{H) binding domain, 
underlined consensus- domains involved in catalysis, * 

25 underneath consensus and boldface -key histidine residue 

involved in catalysis. The putative phosphorylation site 
residues are marked with t and boldface, and are located at 
the C-terminal region of the protein, adjacent to a highly 
conserved arginine, marked with t and boldface. Also 

30 indicated are the conserved Glu (E) , Lys (K) , and Asp (D) 
residues, marked by E, K, and D, respectively. These 
residues are thought to be critical in catalysis, based on 
the crystal structure (Tabeamero et al . , 1999; PNAS 
96(13) :7167-71) . 
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Appendices A through C show SEQ ID Nos : 1 through 3, 
respectively. Appendices D through G show SEQ ID Nos 20 
thorough 22, respectively. 

5 DETAILED DESCRIPTION 

The following detailed description is provided to aid 
those skilled in the art in practicing the present 
invention. Even so, this detailed description should not be 
construed to unduly limit the present invention as 

10 modifications and variations in the embodiments discussed 
herein can be made by those of ordinary skill in the art 
without departing from the spirit or scope of the present 
inventive discovery. 

All publications, patents, patent applications, 

15 databases and other references cited in this application are 
herein incorporated by reference in their entirety as if 
each individual publication, patent, patent application, 
database or other reference were specifically and 
individually indicated to be incorporated by references 

20 We have expressed the full-length forms of the rubber 

and Arabidopsis HMGRs driven by seed- specif ic promoters in 
transgenic canola and soybean. We have demonstrated sterol 
over-production up to 2-4 fold higher in seeds from these 
transgenic plants. We also demonstrated a higher 

25 accumulation of pathway intermediates in soybean than 

canola. These results were disclosed in PCT publication WO 
00/61771. However, we have expressed a truncated form of 
the Arabidopsis hmgl without the linker and membrane 
spanning domains in Arabidopsis and soybean. The results in 

30 Arabidopsis were similar to that demonstrated by Gonzalez et 
al. (1997) and we compared the sterol profiles of our 
transgenic plants with those produced by Gonzalez et al . , 
using our methods to show they are comparable. We found the 
same types of pathway intermediates accumulating. However, 
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in soybean seeds we have demonstrated the accumulation of 
squalene to a very high level (-3mg/g seed which is around 
100-fold higher than in nontransgenic controls) . This is an 
unexpected result not disclosed or suggested in the prior 
5 art. Squalene is a precursor for sterols and in soybean it 
appears that there is a '"bottleneck" in the further 
conversion of this precursor to sterols. Thus, it appears 
that there could be additional ways of over-producing 
sterols in soybean to levels greater than 10 -fold which 

10 would include combining a truncated form of HMGR with other 
genes coding for enzymes down- stream of squalene. 

This opens the potential to combine other genes such as 
squalene epoxidase for further enhancing the levels of 
desirable sterols. Such a combination has not been 

15 disclosed or suggested in the prior art. Squalene expoxidase 
catalyzes the addition of oxygen to squalene which is a 3 0- 
carbon linear isoprenoid chain thus allowing for cyclization 
to form cycloartenol . Additional enzymes downstream that can 
be also be manipulated are sterol methyl transferase 1, C-4 

20 demethylase, C-14 demethylase, sterol methytransf erase 2, 
and C-5 desaturase that would all deplete other pathway 
intermediates shown to accumulate in soybeans- By using such 
strategies it is possible to convert all of the squalene and 
other intermediates to end sterols such as sitosterol, 

25 stigmasterol and campesterol . Thus, sterol level in soybean 
oils can be elevated from 0.3% up to 3.5%. Expression of 
the full-length rubber HMGR in soybeans results in a sterol 
level increase up to 2.7%. 

Enhancement of 3 -hydroxy-3 -methylglutaryl-CoA reductase 

30 (HMG Co-A reductase) activity in certain cells results in 
increased sterol biosynthesis. See, e.g. Chappell, U.S. 
Patent No. 5,58 9,619. The present discovery further 
contemplates an increase of steroid pathway end products 
such as A5 sterols and their stanol counterparts with a 
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decreased accumulation of certain steroid pathway 
intermediates by also enhancing various specific steroid 
pathway enzyme activities, such that more of the steroid 
pathway intermediate compounds are converted to steroid 
5 pathway end products. 

DNA sequences encoding squalene epoxidases are useful 
for removal of squalene accumulation, genes encoding sterol 
methyl transferase I enzymes are useful for removal of 
cylcoartenol accumulation, genes encoding sterol C4- 

10 demethylase are useful for removal of 24 -methylene 

cycloartenol accumulation, genes encoding obtusifoliol C14a- 
demethylases are useful for removal of accumulation of 
obtusifoliol, genes encoding sterol C5-desaturases are 
useful for removal of stigmasta~7-enol accumulation, and 

15 genes encoding sterol methyl transferase II enzymes are " 
useful for the reduction of accumulated campesterol and 
concomitant increase of sitosterol . 

Levels of sitostanol and sitostanol esters can be 
elevated further by approximately 2- to 40-fold over the 

20 transgenic plants of the art having only added genes for HMG 
CoA reductase by introducing additional genes encoding one 
or more of the following sterol pathway enzymes: a. squalene 
epoxidase, a sterol methyl transferase I, a sterol C4- 
demethylase, an obtusifoliol C14a-demethylase, a sterol C5- 

25 . desaturase, a sterol methyl transferase II. 

As used herein, the term "structural coding sequence" 
means a DNA sequence which encodes for a peptide, 
polypeptide, or protein which may be made by a cell 
following transcription of the DNA to mRNA, followed by 

30 translation, to the desired peptide, polypeptide, or protein. 

The term "sterol" as applied to plants refers to any 
chiral tetracyclic isopentenoid which may be formed by 
cyclization of squalene oxide through the transition state 
possessing stereochemistry similar to the trans -syn- trans - 
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anti- trans -anti configuration, i.e., protosteroid cation, 
and which retains a polar group at C-3 (hydroxyl or keto) , 
an all - trans -anti stereochemistry in the ring system, and a 
side-chain 20R-conf iguration (Parker et al . (1992) In Nes 
5 et al . , Eds., Regulation of I sopentenoid Metabolism, ACS 

Symposium Series No. 497, p. 110; American Chemical Society, 
Washington, D.C.). The numbering of the carbon atoms of a 
representative sterol (cholesterol) is shown in the 
following structure (FORMULA II) : 

10 As used herein, the term "sterol" refers to unsaturated 

hydroxyl group- containing derivatives of a fused, reduced 
ring system, cyclopenta [a] -phenanthrene, comprising three 
fused cyclohexane rings (A, B and C) in a phenanthrene 
arrangement, and a terminal cyclopentane ring (D) . The 

15 exemplary steroid below (FORMUIiA II) illustrates the 

numbering system employed herein in describing the location 
of groups and sxibstituents . 




20 Sterols may or may not contain a C-5 to C-6 double 

bond, as this is a feature introduced late in the 
biosynthetic pathway (note Scheme 1, below) . Sterols 
contain a Cg-Cio side chain at the C-17 position, as shown 
above . 

25 The term "phytosterol , " which applies to sterols found 

uniquely in plants, refers to a sterol containing a C-5, and 
in some cases a C-22, double bond. Phytosterols are further 
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characterized by alkylation of the C-17 side-chain with a 
methyl or ethyl substituent at the C-24 position. Major 
phytosterols include, but are not limited to, sitosterol, 
stigmasterol , campesterol, brassicasterol , etc - 
5 Cholesterol, which lacks a C-24 methyl or ethyl side chain,, 
is found in plants but is not unique thereto, and is not a 
"phytosterol " 

"Phytostanols" are saturated forms of phytosterols 
wherein the C-5 and, when present, C-22 double bond(s) 
10 is (are) reduced, and include, . but are not limited to, 
sitostanol, campestanol, and 22-dihydrobrassicastanol . 

"Phytosterol esters" and "phytostanol esters" are 
further characterized by the presence of a fatty acid or 
phenolic acid moiety rather than a hydroxyl group at the C-3 
15 position. 

The term "steroid compounds" includes sterols, 
phytosterols, phytosterol esters, phytostanols, and 
phytostanol esters . 

The term "phytosterol compound" refers to at least one 
20 phytosterol, at least one phytosterol ester, or a mixture 
thereof . 

The term "phytostanol compound" refers to at least one 
phytostanol, at least one phytostanol ester, or a mixture 
thereof . 

25 The term "constitutive promoter" refers to a promoter 

that operates continuously in a cell, and which is not 
subject to quantitative regulation. The gene with which 
such a promoter is associated is always "turned on. " 

The terms "seed-specific," "fruit-specific," "plastid- 

30 specific," etc., as they apply to promoters refer to 

preferential or exclusive activity of these promoters in 
these organs or organelles, respectively. "Preferential 
expression" refers to promoter activity greater in the 
indicated organs or organelles than elsewhere in the plant . 
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"Seed^specif ic" comprehends expression in the aleurone 
layer, endosperm, and/or embryo of the seed. 

As used herein ^'isolated polynucleotide" means a 
polynucleotide that is free of one or both of the nucleotide 
5 sequences which flank the polynucleotide in the naturally- 
occurring genome of the organism from which the 
polynucleotide is derived- The term includes, for example, 
a polynucleotide or fragment thereof that is incorporated 
into a vector or expression cassette; into an autonomously 

10 replicating plasmid or virus; into the genomic DNA of a 
prokaryote or eukaryote; or that exists as a separate 
molecule independent of other polynucleotides. It also 
includes a recombinant polynuclefotide that is part of a 
hybrid polynucleotide, for example, one encoding a 

15 polypeptide sequence. 

As used herein ^^polynucleotide" and ^^oligonucleotide" 
are used interchangeably and refer to a polymeric (2 or 
more monomers) form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . Although 

20 nucleotides are usually joined by phosphodiester linkages, 
the term also includes polymeric nucleotides containing 
neutral amide backbone linkages composed of aminoethyl 
glycine units. This term refers only to the primary 
structure of the molecule. Thus, this term includes double- 

25 and single -stranded DNA and RNA. It also includes known 
types of modifications, for example, labels, methylation, 
"caps", substitution of one or more of the naturally 
occurring nucleotides with an analog, internucleot ide 
modifications such as, for example, those with uncharged 

30 linkages (e.g., methyl phosphonates , phosphotriesters , 

phosphoamidates, carbamates, etc.), those containing pendant 
moieties, such as, for example, proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides, poly- 
L-lysine, etc.), those with intercalators (e.g., acridine. 
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psoralen, etc.)/ those containing chelators (e.g., metals, 
radioactive metals, boron, oxidative metals, etc.), those 
containing alkylators , those with modified linkages (e.g., 
alpha anomeric nucleic acids, etc.), as well as unmodified 
5 forms of the polynucleotide. Polynucleotides include both 
sense and antisense strands. 

The alternative nucleotide sequences described above 
are considered to possess substantially similar enzymatic 
activity to that of the polypeptide -encoding polynucleotide 

10 sequences of the present invention if they encode 

polypeptides having enzymatic activity differing from that 
of any of the polypeptides encoded by the polynucleotide 
sequences of the present invention by about 30% or less, 
preferably by about 2 0% or less, and more preferably by 

15 about 10% or less when assayed by standard enzymatic assays. 
As used herein "effective amount" is intended to 
qualify the amount of an agent which will achieve the goal 
of a lessening in the severity and/ or the frequency of, 
incidence of a disease condition or disorder, over no ri 

20 treatment . 

The phrase steroid pathway products" refers to the 
products of steroid biosynthesis produced by the action of 
one or more of squalene epoxidase enzyme, sterol methyl 
transferase I enzyme, sterol C4-demethylase enzyme, 

25 obtusifoliol C14a-demethylase enzyme, sterol C5-desaturase 
enzyme, and sterol methyl transferase II enzyme. Specific 
examples of steroid pathway products include, but are not 
limited to, sitosterol, sitostanol, stigmasterol and 
stigmastanol . 

30 In the context of the present disclosure, a ^'non- 

transformed" plant or cell refers to a plant or cells which 
does not comprise introduced polynucleotides encoding a 
polypeptide having 3 -hydroxy- 3 -methyulglutaryl -Coenzyme A 
reductase enzyme activity and at least one polypeptide 
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having squalene epoxidase enzyme activity, sterol methyl 
transferase I enzyme activity, sterol C4-demethylase enzyme 
activity, obtusifoliol C14a-demethylase enzyme activity, 
sterol C5-desaturase enzyme activity, or sterol methyl 
5 transferase II enzyme activity. Thus, a plant or cell that 
contains introduced polynucleotide sequences other than 
those above, would still be considered ^^non-transf ormed. " 

As used herein, ""peptide" and ""protein" are used 
interchangeably and mean a compound that consists of two or 
10 more amino acids that are linked by means of peptide bonds. 



I . Plant Steroid Biosynthesis 

To aid the reader in understanding the present 
invention, descriptions of the sterol compound biosynthetic 

15 pathway are presented below. These descriptions identify 
enzymes useful in achieving the modifications to the 
biosynthesis and accumulation of sterol compounds described 
herein, and identify sources of nucleic acid sequences 
encoding these enzymes. 

20 Various steps in the steroid compound biosynthetic 

pathway in plants are shown in Scheme 1, below. The numbers 
over the arrows refer to plant sterol compound biosynthetic 
pathway enzymes and genes as indicated in Table 1 . 



Q2061072A2 t > 



wo 02/061072 



PCT/US02/00255 




wo 02/061072 



PCT/US02/00255 



38 



Table 1 ' 

Plant Sterol Compound Pathway Enzymes and Genes 



Enzyme 



Step in 
Pathway 



GenBank 
Gene ID 



Acetoacetyl-CoA thiolase 
HMG-CoA synthase 
HMG-CoA reductase 

Mevalonate kinase 

Pho sphome va 1 ona t e ki na s e 

Mevalonate pyrophosphate 
decarboxylase 

Isopentenyl diphosphate 
isomerase 

Farne syl pyrophosphate 
synthase 

Squalene synthase 

Squal ene epoxi das e 

Squalene cyclase 
Sterol C-24 
methyl transferase 

Sterol C-4 demethylase 

Cycloeucalenol - 
obtus i f ol iol i somerase 

Sterol C-14 demethylase 
Sterol C-14 reductase 



1 
2 
3 

4 
5 



8 



10 

11 

12, 18 

13, 19 

14 

15 
16 



X78116 
X83882 
X15032 
L19262 
X77793 
Not 

available 
Y14325 

U49259 
U47324 
X75789 

AF0045 
60 

AB0168 
83 

U87266 
U71400 

Not 

available 
Not 

available 
U74319 
PCT WO 



02061072A2 I > 



wo 02/0614172 



PCT/US02/00255 



39 



Sterol C-8 isomerase 

Sterol C-5 desaturase 
Sterol C-7 reductase 
Sterol C-24 isomerase 



Sterol C-24 reductase 



Sterol C-22 desaturase 



Sterol C-5 reductase 



97/48793 
17 AF0303 
57 

20 X90454 

21 U49398 

22 . Klahre 

et al . 
(1998) 
Plant Cell 
10: 1677- 
1690 

23 Same 
as 22 

24 Not 
available 

25 WO 
00/61771 



10 



15 



The plant sterol compound biosynthesis pathway has two 
distinct components. The early pathway reactions, leading 
from acetyl -CoA to squalene via mevalonic acid, are common 
to other isoprenoids. The later pathway reactions, leading 
from squalene to the major plant sterol compounds such as 
sitosterol, campesterol and stigmasterol , are committed 
biosynthetic reactions. 

The early pathway reactions have been studied in fungi 
and plants (Lees et al . , Biochemlsizry and Function of 
Sterols, Nes and Parish, Eds., CRC Press, 85-99 (1997); 
Newman and Chappell, Biochemistjtry and Function of Sterols, 
Nes and Parish, Eds., CRC Press, 123-134 (1997); Bach et 
al.. Biochemistry and Function of Sterols, Nes and Parish, 
Eds., CRC Press, 135-150 (1997)). 
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Acetoacetyl CoA thiolase (EC 2.3.1,9) catalyzes the 
first reported reaction, which consists of the formation of 
acetoacetyl CoA from two molecules of acetyl CoA (Dixon et 
al., J. Steroid BiochGm. Mol . Biol. 62: 165-171 (1997)). 
5 This enzyme has been purified from radish. A radish cDNA 
has been isolated by functional complementation in 
Sacchaxomyces cerevisiae (GeneBank Accession # X78116) . A 
radish cDNA has also been screened against a cDNA library of 
Arabidopsis thaliana (Vollack arid Bach, Plant Physiology 

10 111: 1097-1107 (1996)). 

HMGCoA ■ synthase (EC 4 . 1 • 3 . 5) catalyzes the production 
of HMGCoA. This reaction condenses acetyl CoA with 
acetoacetyl CoA to yield HMGCoA.' HMGCoA synthase has been 
purified from yeast. A plant HMGCoA synthase cDNA has also 

15 been isolated from Arabidopsis thaliana (Montamat et al . , 
Gene 167: 197-201 (1995)). 

HMGCoA reductase, also referred to as 3 -hydroxy- 3 - 
methyglutaryl -coenzyme A (EC 1.1.1.34), catalyzes the 
reductive conversion of HMGCoA to mevalonic acid (MVA) . 

20 This reaction is reported to play a role in controlling 

plant isoprenoid biosynthesis (Gray, Adv. Bat. Res. 14: 25- 
91 (1987); Bach et al . , Lipids 26 1 637-648 (1991); Stermer 
et al., J-. Lipid Res, 35: 1133-1140 (1994). Plant HMGCoA 
reductase genes are often encoded by multigene families. 

25 The number of genes comprising each multigene family varies, 
depending on the species, ranging from two in Arabidopsis 
thaliana to at least seven in potato. Overexpression of 
plant HMGCoA reductase genes in transgenic tobacco plants 
has been reported to result in the overproduction of 

30 phytosterols (Schaller et al . , Plant Physiol. 109: 761-770 
(1995) ) . 

Mevalonate kinase (EC 2.7.1.36) catalyzes the 
phosphorylation of mevalonate to produce mevalonate 5- 
phosphate. It has been reported that mevalonate kinase 
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plays a role in the control of isoprenoid biosynthesis 
(Lalitha et al . , Indian. J. Biochem. Biophys. 23: 249-253 
(1986)). A mevalonate kinase gene from Ajrabidopsis thaliana 
has been cloned (GeneBank accession number X77793; Riou et 
5 al.. Gene 148: 293-297 (1994)). 

Phosphomevalonate kinase (EC 2.7.4.2) (MVAP kinase) is 
an enzyme associated with isoprene and ergosterol 
biosynthesis that converts mevalonate -5 -phosphate to 
mevalonate -5 -pyrophosphate utilizing ATP (Tsay et al . , Mol . 
10 Cell. Biol. 11: 620-631 (1991)). 

Mevalonate pyrophosphate decarboxylase ("MVAPP 
decarboxylase") (EC 4.1.1.33) catalyzes the conversion of 
mevalonate pyrophosphate to isopentenyl diphosphate ("IPP"). 
The reaction is reported to be a decarboxyl at ion/dehydration 

15 reaction which hydrolyzes ATP and requires Mg2+. a cDNA " 
encoding Arabidapsis thaliana MVAPP decarboxylase has been 
isolated (Toth et al • , J". Biol. Chem. 271: 7895-7898 
(1996) ) . An isolated Arabidopsis thaliana MVAPP 
decarboxylase gene was reported to be able to complement the 

20 yeast MVAPP decarboxylase . 

Isopentenyl diphosphate isomerase ( "IPP:DMAPP" ) (EC 
5.3.3.2) catalyzes the formation of dimethylallyl 
pyrophosphate (DMAPP) from isopentenyl pyrophosphate (IPP) . 
Plant IPP: DMAPP isomerase gene sequences have been reported 

25 for this enzyme. It has also been reported that IPP: DMAPP 
isomerase is involved in rubber biosynthesis in a latex 
extract from Hevea (Tangpakdee et al . , Phytochexnistry 45: 
261-267 (1997) . 

Farnesyl pyrophosphate synthase (EC 2.5.1.1) is a 

30 prenyl transferase which has been reported to play a role in 
providing polyisoprenoids for sterol compound biosynthesis 
as well as a number of other pathways (Li et al . , Gene 17: 
193-196 (1996) ) • Farnesyl pyrophosphate synthase combines 
DMAPP with IPP to yield geranyl pyrophosphate ("GPP") . The 
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same enzyme condenses GPP with a second molecule of IPP to 
produce farnesyl pyrophosphate ("FPP")- FPP is a molecule 
that can proceed down the pathway to sterol compound 
synthesis, or that can be shuttled through other pathways 
5 leading to the synthesis of cfuinones or sesquiterpenes. 

Squalene synthase (EC 2.5.1.21) reductively condenses 
two molecules of FPP in the presence of Mg2+ and NADPH to 
form scfualene. The reaction involves a head- to-head 
condensation, and forms a stable intermediate, presqualene 

10 diphosphate. The enzyme is subject to sterol demand 
regulation similar to that of HMGCoA reductase. The 
activity of squalene synthase has been reported to have a 
regulatory effect on the incorporation of FPP into sterol 
and other compounds for which it serves as a precursor 

15 (Devarenne et al . , Arch. Biochem. Biophys . 349: 205-215 
(1998) ) . 

Squalene epoxidase (EC 1.14.99.7) (also called squalene 
monooxygenase) catalyzes the conversion of squalene to 
squalene epoxide (2 , 3-oxidosqualene) , a precursor to the 

20 initial sterol molecule in the sterol compound biosynthetic 
pathway, cycloartenol . This is the first reported step in 
the pathway where oxygen is required for activity. The 
formation of squalene epoxide is also the last common 
reported step in sterol biosynthesis of animals, fungi, and 

25 plants. 

The later pathway of sterol compound biosynthetic steps 
starts with the cyclization of squalene epoxide and ends 
with the formation of 5 -24-alkyl sterols in plants. 

2 , 3 -oxidosqualene cycloartenol cyclase (EC 5.4.99-8) 
30 (also called cycloartenol synthase) is the first step in the 
sterol compound pathway that is plant-specific. The 
cyclization of 2 , 3-oxidosqualene leads to lanosterol in 
animals and fungi, while in plants the product is 
cycloartenol. Cycloartenol contains a 9 , 19-cyclopropyl 
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ring. The cyclization is reported to proceed from the epoxy 
end in a chair-boat -chair-boat sequence that is mediated by 
a transient C-2 0 carbocationic intermediate. 

S-adenosyl-L-methionine : sterol C-24 methyl transferase 
5 ("SMTl") (EC 2.1.1.41) catalyzes the transfer of a methyl 

group from S-adenosyl-L-methionine to the C-24 center of the 
sterol side chain (Nes et al . (1991) J. Biol. Chem. 
266 (23) :15202-15212) . This is the first of two methyl 
transfer reactions that have been reported to be an 
10 obligatory and rate-limiting step of the sterol compound- 
producing pathway in plants. The second methyl transfer 
reaction occurs later in the pathway after the A®"'' 
isomerase. The enzyme responsible for the second methyl • 
transfer reaction is named SMTII (Bouvier-Nave , P. et al . , 
15 (1997) Eur. J. Biochem. , 246: 518-529). An isoform, SMTII, 
catalyzes the conversion of cycloartenol to a A"^^*^ -24-alkyl 
sterol, cyclosadol • (Guo et al . (1996) Tetrahed. iett. 
37 (38) :6823-6826) . 

Sterol C-4 demethylase catalyzes the first of several 
20 demethylation reactions, which results in the removal of the 
two methyl groups at C-4. While in animals and fiongi the 
removal of the two C-4 methyl groups occurs consecutively, 
in plants it has been reported that there are other steps 
between the first and second C-4 demethylations . The C-4 
25 demethylation is catalyzed by a complex of microsomal 

enzymes consisting of a monooxygenase , an NAD"^ -dependent 
sterol 4 -decarboxylase, and an NADPH- dependent 3-ketosteroid 
reductase . 

Cycloeucalenol -obtusif oliol isomerase ( "COI " ) catalyzes 
30 the opening of the cyclopropyl ring at C-9. The opening of 
the cyclopropyl ring at C-9 creates a double bond at C-8. 

Sterol C-14 demethylase catalyzes demethylation at C- 
14, which removes the methyl group at C-14 and creates a 
double bond at that position. In both fungi and animals, 
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this is the first step in the sterol synthesis pathway. 
Sterol 14 -demethylation is mediated by a cytochrome P-450 
complex. 

Sterol C-14 reductase catalyzes a C-14 demethylation 
5 that results in the formation of a double bond at C-14 

(Ellis et al.. Gen. Microbiol. 137: 2627-2630 (1991)). This 
double bond is removed by a A^^ reductase . The normal 
substrate is 4 -methyl -8 , 14 , 24 (24^) -trien-3P-ol . NADPH is 
the normal reductant . 
10 Sterol C-8 isomerase catalyzes a reaction that involves 

further modification of the tetracyclic rings or the side 
chain (Duratti et al . , BioctiGin. Pharmacal . 34: 2765-2777 
(1985) ) . The kinetics of the sterol isomerase-catalyzed 
reaction favor a A® 6 isomerase reaction that produces a 
15 A'' group. 

Sterol C-5 desaturase catalyzes the insertion of the A^- 
double bond that normally occurs at the A''-sterol level, 
thereby forming a A^''^-sterol (Parks et al . , Lipids 30: 227- 
23 0 (1995) ) . The reaction has been reported to involve the 

20 stereospecif ic removal of the 5a and 6a hydrogen atoms, 
biosynthetically derived from the 4 pro-R and 5 pro-S 
hydrogens of the (+) and (-) R-mevalonic acid, respectively. 
The reaction is obligatorily aerobic, and requires NADPH or 
NADH. The desaturase has been reported to be a multienzyme 

25 complex present in microsomes. It consists of the 

desaturase itself, cytochrome b5^ and a pyridine nucleotide- 

dependent f lavoprotein. The A^-desaturase is reported to be 
a mono -oxygenase that utilizes .electrons derived from a 
reduced pyridine nucleotide via cytochrome 

30 Sterol C-7 reductase catalyzes the reduction of a A"'- 

double bond in A^'''-sterols to generate the corresponding A^- 
sterol- It has been reported that the mechanism involves. 
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like many other sterol enzymes, the formation of a 
carbocat ionic intermediate via electrophilic "attack" by a 
proton. 

Sterol C-24(28) isomerase catalyzes the reduction of a 
5 A^**^°^-A^^, a conversion that modifies the side chain. The 
product is a A^*^^^^ -24 -alkyl sterol. Sterol C-24 reductase 
catalyzes the reduction of the ^^^^^^ double bond at C-24, 
which produces sitosterol. Recently, Klahre et al . ((1998) 
Plant Cell 10:1677-1690) discovered that both the 

10 isomerization and reduction steps are catalyzed by an enzyme 
coded by the same gene , i.e., DIM/DWFl . 

Sterol C-22 desaturase (EC 2. 7. 3.. 9) catalyzes the 
formation of a double bond at C-22 on the side chain. This 
formation of a double bond at C-22 on the side chain marks 

15 the end of the sterol compound biosynthetic pathway, and 

results in the formation of stigmasterol (Benveniste (1986) 
Annu. Rev. Plant Physiol. 37:275-308). The C-22 desaturase 
in yeast, which is the reported final step in the 
biosynthesis of ergosterol in that organism, requires NADPH 

20 and molecular oxygen. In addition, the reaction is also 

reported to involve a cytochrome P450 that is distinct from 
a cytochrome P450 participating in demethylation reactions 
(Lees et al • (1995) Lipids 30: 221-226). 

Phytosterols are biogenetic precursors of 

25 brassinosteroids , steroid alkaloids, steroid sapogenins, 

ecdysteroids, and steroid hormones. This precursor role of 
phytosterols is often described as a "metabolic" function. 
A common transformation of free sterols in tissues of 
vascular plants is the conjugation at the 3 -hydroxy group of 

30 sterols with long-chain fatty acids to form steryl esters, 
or with a sugar, usually with a single molecule of p-D- 
glucose, to foarm steryl glycosides. Some of the steryl 
glycosides are additionally esterified, at the 6 -hydroxy 
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group of the sugar moiety, with long-chain fatty acids to 
form acylated steryl glycosides. 

The existence of several enzymes that are specifically 
associated with the synthesis and breakdown of conjugated 
5 sterols has been reported (Wojciechowski, Physiology and 
Biochemistry of Sterols, eds. Patterson, Nes, AOCS Press, 
361 (1991))- Enzymes involved in this process include: 
UDPGlc : Sterol glucosyltransf erase , phospho (galacto) glyceride 
steryl glucoside acyl transferase, and sterylglycoside and 

10 sterylester hydrolases. 

UDPGlc : sterol glucosyltransf erase (EC 2.4.1.173) 
catalyzes glucosylation of phytosterols by glucose transfer 
from UDP-glucose ("UDPGl") . The formation of steryl 
glycosides can be measured using UDP- [^^C] glucose as the 

15 substrate- Despite certain differences in their specificity 
patterns, all reported UDPGlc : sterol glucosyltransf erases 
preferentially glucosylate only sterols or sterol -like 
molecules that contain a C-3 hydroxy group, a (3- 
conf iguration, and which exhibit a planar ring. It has been 

20 reported that UDPGlc : sterol glucosyltransf erases are 
localized in the microsomes. 

Phospho (galacto) glyceride steryl glucoside 
acyltransf erase catalyzes the formation of acylated steryl 
glycosides from the siibstrate steryl glycoside by transfer 

25 of acyl groups from some membranous polar acyl lipids to 
steryl glycoside molecules . 

Acylglycerol : sterol acyltransf erase (EC 2.3.1.26) 
catalyzes the reaction wherein certain acylglycerols act as 
acyl donors in a phytosterol esterif ication . In plants, the 

30 activity of acylglycerol : sterol acyltransf erase is reported 
to be associated with membranous fractions. A pronounced 
specificity for shorter chain unsaturated fatty acids was 
reported for all acyltransf erase preparations studied in 
plants. For example, acylglycerol : sterol acyltransf erases 
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from spinach leaves and mustard roots can esterify a number 
of phytosterols . 

Sterylglycoside and sterylester hydrolases ("SG- 
hydrolases") catalyze the enzymatic hydrolysis of 

5 sterylglycosides to form free sterols. The SG-hydrolase 
activity is not found in mature, ungerminatecj seeds, is 
reported to emerge only after the third day of gemination, 
and is found mainly in the cotyledons . It has been reported 
that phospho (galacto) glyceride : SG acyltranaf erase may 

0 catalyze a reversible reaction. Enzymatic hydrolysis of 
sterylesters in germinating seeds of mustard, barley and 
corn is reported to be low in dormant seeds, but increases 
during the first ten days of germination. This activity is 
consistent with a decrease in sterylesters and an increase 

5 in free sterols over the same temporal period. 

II. Processes for Modifying Steroid Compound 
Biosynthesis and Accumulation 

In order to obtain seed producing oil containing 
3 elevated levels of phytostanols and phytostanol esters such 
as sitostanol and sitostanol esters, these recombinant 
constructs or expression cassettes can be introduced into 
plant cells by any number of conventional means known in the 
art and regenerated into fertile transgenic plants. The 
5 genome of such plants can then comprise introduced DNA 
encoding various steroid pathway enzymes, alone or in 
combination, that achieves the desirable effect of enhancing 
the levels of phytostanols, phytostanol esters, mixtures 
thereof in the oil of seed thereof. 
) Preferably, the genome can comprise introduced DNA 

encoding a HMG CoA reductase enzyme and an introduced DNA 
encoding one or more of a squalene epoxidase, a sterol 
methyl transferase I, a sterol C4-demethylase, an 
obtusifoliol C14a-demethylase, a sterol C5-desaturase, a 
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Sterol methyl transferase II. In each case^ the foregoing 
introduced DNAs can be operatively linked to regulatory 
signals that cause seed-specific expression thereof. 
The present invention encompasses not only such 
5 transgenic plants, but also transformed plant cells, 

including cells and seed of such plants, as well as progeny 
of such plants, for example produced from the seed. 
Transformed plant cells and cells of the transgenic plants 
encompassed herein can be grown in culture for a time and 

10 under appropriate conditions to produce oil containing 
elevated levels of phytosterols and/or phytostanols and 
their corresponding esters. Alternatively, the 
phytosterols, phytostanols, and their corresponding esters 
can be isolated directly from the cultures. 

15 In addition, of course, seed obtained from the 

transgenic, progeny, hybrid, etc., plants disclosed herein 
can be used in methods for obtaining oil containing 
phytosterols, phytosterol esters, phytostanols, phytostanol 
esters, or mixtures thereof employing extraction and 

20 processing procedures known in the art. Note, in this 
regard, Kochhar (1983) Prog. Lipid Res. 22: 161-188. 

The present invention also encompasses a method of 
producing a plant that accumulates an elevated level of 
sitosterol, at least one sitosterol ester, sitostanol, at 

25 least one sitostanol ester, or mixtures thereof, in seeds 
thereof compared to seeds of a corresponding plant 
comprising no introduced DNA encoding a polypeptide or 
protein that affects the biosynthesis of sterols, 
phytosterols, phytosterol esters, phytostanols, phytostanol 

30 esters, or combinations thereof, comprising sexually 

crossing a transgenic plant of the present invention with 
such a corresponding plant. The latter can be a non- 
transgenic plant, or a transgenic plant containing 
introduced DNA encoding a trait other than one affecting 
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sterol, phytosterol, etc., biosynthesis. For example, such 
trait may be insect or herbicide resistance. Plants 
produced by this method also form part of the present 
invention. 

5 Also included are plants that accumulate an elevated 

level of sitosterol, at least one sitosterol, ester, 
sitostanol, at least one sitostanol ester, or mixtures 
thereof, in seeds thereof compared to seeds of a 
corresponding plant comprising no introduced DNA encoding a 
10 polypeptide or protein that affects the biosynthesis of 
sterols, phytosterol s, phytosterol esters, phytostanols , 
phytostanol esters, or combinations thereof, which are 
apomictic . 

A process of increasing the formation of steroid 

15 pathway products in a transformed host cell as compared to 
an otherwise identical non- trans formed host cell comprising 
the following steps. A host cell is transformed with a 
recombinant vector comprising (a) as operably linked 
components in the 5' to 3 ' direction, a promoter, a DNA 

20 sequence encoding a polypeptide exhibiting 3-hydroxy-3- 

methylglutaryl- Coenzyme A reductase enzyme activity, and a 
transcription termination signal sequence; and (b) as 
operably linked components in the 5' to 3 ■ direction, a 
promoter, a DNA sequence encoding a steroid pathway enzyme, 

25 and a transcription termination signal sequence. The 

steroid pathway enzyme is a squalene epoxidase enzyme, a 
sterol methyl transferase I enzyme, a sterol C4-demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, and a sterol methyl transferase II 

30 enzyme. The transformed plant cell is regenerated into a 
transgenic plant. 

A plant contemplated by this invention is a vascular, 
multicellular higher plant. Such higher plants will 
hereinafter by usually referred to simply as "plants". Such 
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"plants" include both complete entities having leaves, 
stems, seeds, roots and the like as well as callus and cell 
cultures that are monocotyledonous and dicotyledonous. 
Dicotyledonous plants are a preferred embodiment of the 
5 present invention. 

Preferred plants are members of the Solanaceae, 
Leguminosae, Ammiaceae, Brassicaceae, Gramineae, Carduaceae 
and Malvaceae families. Exemplary plant members of those 
families are tobacco, petunia and tomato (Solanaceae) , 
10 soybean and alfalfa (Leguminosae) , carrot (Ammiaceae) , corn, 
maize and barley (Gramineae) , AraJbidopsls (Brassicaceae) , 
guayule (Carduaceae) , and cotton (Malvaceae) . A preferred 
plant is tobacco of the strain. Nlcotiana tabacum {N. 
Tabacum) , cotton of the strain Coker line 312-5A, soybean of 

15 the strain Glycine max, alfalfa of the strain RYSI or tomato 
of the strain LycopGirsicon esculentium. Other plants 
include canola, maize and rape. 

A transgenic plant contemplated by this invention is 
produced by transforming a plant cell or protoplast with an 

20 added, exogenous structural gene that encodes a polypeptide 
having HMG-CoA reductase activity and an exogenous 
structural gene that encodes at least one polypeptide have 
steroid pathway enzyme activity to produce a transformed 
plant cell, and regenerating a transgenic plant form the 

25 transformed plant cell. The encoded polypeptide is 

expressed both in the transformed plant cell or protoplast 
and the resulting transgenic plant. (The phrase "plant 
cell" will hereinafter be used to include a plant 
protoplast, except where plant protoplasts are specifically 

30 discussed) . 

A non- transgenic plant that serves as the source of the 
plant cell that is transformed, i.e. the precursor cell, is 
referred to herein as a "native, non- transgenic" plant. The 
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native, non- transgenic plant is of the same strain as the 
formed transgenic plant. 

Sterol production in a transgenic plant of the present 
invention is increased by increasing the activity of the 
5 enzyme HMG-CoA reductase, which enzyme catalyzes the 

conversion of 3 -hydroxy-3-methylglutaryl Coenzyme A (HMG- 
CoA) to mevalonate and the activity of at least one other 
steroid pathway enzyme. As used herein, the term "specific 
activity" means the activity normalized to cellular protein 
10 content . 

HMG-CoA reductase activity is increased by increasing 
the amount (copy number) of a gene encoding a polypeptide 
having HMG-CoA reductase catalytic activity. Expression of 
the increased amount of that encoded structural gene 

15 enhances the activity of that enzyme. 

The amount of the expressed gene is increased by 
transforming a plant cell with a recombinant DNA molecule 
comprising a vector operatively linked to a DNA segment that 
encodes a polypeptide having HMG-CoA reductase activitiy, 

20 and a promoter suitable for driving the expression of that 
polypeptide in that plant cell, and culturing the 
transformed plant cell into a transgenic plant. Such a 
polypeptide includes intact as well as a catalytically 
active, truncated HMG-CoA reductase proteins. 

25 Thus, a transformed plant cell and a transgenic plant 

have one or more added, exogenous genes that encode a 
polypeptide having HMG-CoA reductase activity and at least 
one other steroid pathway enzyme activty relative to a 
native, non- transgenic plant or untransf ormed plant cell of 

30 the same type. As such, a transformed plant cell or 

transgenic plant can be distinguished from an untransf ormed 
plant cell or native, nontransgenic plant by standard 
technology such as agarose separation of DNA fragments or 
mRNAs followed by transfer and appropriate blotting with DNA 
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or RNA, e.g., Southeim or Northern blotting, or by use of 
polymerase chain reaction technology, as are well known. 
Relative HMG-CoA reductase activity of the transformed cell 
or transgenic plant with untransf ormed cells and native, 
5 non-transgenic plants or cell cultures therefrom can also be 
compared, with a relative activity for that enzyme of about 
1.5:1 for transgenic (transformed) to native (untransf ormed) 
showing transformation. Higher relative activity ratios 
such as about 15:1 have also been observed. 

10 Sterol accumulation can also be used to distinguish 

between native, non- transgenic and transgenic plants. A 
transgenic plant has at least about twice the total sterol 
content as a native, non- transgenic plant where a single 
added gene is present. Greater differences up to about 

15 forty- fold have also been observed. 

Sitostanol, sitostanol ester, and tocopherol 
biosynthesis and accumulation in plants can be modified in 
accordance with the present invention by variously 
expressing the nucleic acid coding sequences discussed 

20 above, alone or in combination, as described herein. The 

expression of sequences encoding sterol methyltransf erase II 
enzymes facilitates the production of plants in which the 
biosynthesis and accumulation of campesterol, campestanol, 
and their esters can be reduced as these enzymes shunt 

25 sterol intermediates away from campesterol, and toward 
sitosterol and sitostanol. 

Ill . DNA Encoding Useful Polypeptides 
The present invention contemplates a recombinant 
30 construct or a recombinant vector that contains a DNA 

sequence encoding a polypeptide exhibiting 3 -hydroxy- 3- 
methylglutaryl- Coenzyme A (HMG-CoA) reductase activity and a 
DNA sequence encoding a polypeptide exhibiting the activity 
of a steroid pathway enzyme. Each polypeptide -encoding DNA 
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sequence is operably linked in the 5* to 3 ' direction 
independent of the other sequence . Each DNA sequence in the 
5' to 3' direction comprises a promoter, then the DNA 
sequence encoding the polypeptide then a transcription 
5 termination signal sequence. The steroid pathway enzyme is 
a squalene epoxidase enzyme, a sterol methyl transferase I 
enzyme, a sterol C4-demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. It is contemplated 

10 that HMG-CoA reductase and steroid pathway enzyme activities 
come from a mutant or truncated form of those enzymes, such 
as a truncated HMG-CoA reductase lacking the transmembrane 
region while retaining a functional catalytic domain. 
Several HMG CoA reductase sequences are known in the art . 

15 An amino acid alignment for these is shown in FIG. 32. " The 
sources of the sequences used in building the multiple 
alignment are listed in Table 5. 
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Table 5. Sources of Sequences Used In Building The 
Multiple Alignment 



methanobac 

methanococ 

halobacter 

sulfoiobus 

yeast2 

yeasti 

phycomyces 

fusarium 

Candida 

dictyoste2 

wheati 

rice 

com 

wheat3 

wheat2 

soybean 

rubbertreS 

rosyperiwi 

tomato 

woodtobacc 

potato 

radish 

arabadopsisl 

cucumismel 

rubbertre2 

rubbertrel 

camptothec 

arabadops2 

chineseham 

chineseha2 

syrianhamst 

rat 

rabbit 

human 

mouse 

xenopus 

seaurchin 

cockroach 

drosophila 

dictyostel 

schistosom 

archaeoglo 

pseudomonas 



swissprothmdh_metth 

8 wi ssprot: h md h^metja 

swissprothmdh__halvo 

swissprothmdh_sulso 

gpjplnl :yschmgcr2_1 

gp_pln1 :yschmgcr1_1 

swissprothmdh^phybl 

swlssprot:hmdh_fusmo 

gp_pln1 :ab012603_1 

swissprot:hmd2_diodi 

p!r2:pq0761 

swissprothmdh_orysa 

sp_planfco24594 

pir2:pq0763 

pir2:pq0762 

gnnbc6:30820„1r59f1 

swissprothmd3_hevbr 

swissprot:hmdh_catro 

swissprot:hmd2Jyces 

swissprot:hmdh_nicsy 

gp_p!n1 :pothmgri_1 

sp_plant:q43826 

gp_pln1 :athhmgcoar_1 

gp_pln1:ab021862_1 

swissprothmd2__hevbr 

swissprothmd 1_hevbr 

swissprothmdh_camac 

swisspfDt:hmd2_arath 

swissprotrh mdh_crigr 

gp_rod:cruhmg 1 4_1 

gp_rod:hamhmgcob_1 

swissprot:hmdh_rat 

swissprot:hmdh_rabit 

gp_pri2:humhmgcoa_1 

gp_rod :mushnngcoa_1 

swissprot:hmdh_xenla 

swissprot:hmdh_strpu 

swisspFOt:hmd h_blage 

Swiss prothmdh_drome 

swissprothmd1_dicdi 

swjssprothmdh_schma 

swissprot:hmdh_arcfu 

gp_bct1 :psehmgcoa_1 
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026662 methanobacterium thermoautotrophicur 
Q581 16 methanococcus jannaschii. 3-hydroxy-3 
Q59468 halobacterium volcanii (haloferax volcar 
008424 sulfoiobus solfataricus. 3-hydroxy-3-me 
M22255 Saccharomyces cerevisiae Yeast HWiG- 
M22002 Saccharomyces cerevisiae Yeast HMG 
Q12649 phycomyces blaltesleeanus. 3-hydroxy- 
Q 12577 fusarium monilifbrme (gibberella fujikurc 
AB012603 Candida utiKs Candida utilis HMG mF 
P34136 dictyostelium discoideum (slime mold). : 
hydroxymethylglutaryl-CoA reductase (NADPH) 
P48019 oryza sativa (rice). 3-hydroxy-3-methylg 
024594 zea mays (maize). 3-hydroxy-3-methylc 
hydroxymethylglutary)-CoA reductase (NADPH) 
hydroxymethylglutaryl-CoA reductase (NADPIH) 
from proprietary soy sequence database 
Q00583 hevea brasHiensis (para rubber tree). 3- 
Q03163 catharanthus roseus (rosy periwinkle) (i 
P48022 lycopersicon escuientum (tomato). 3-hyi 
Q01559 nicotiana sylvestris (wood tobacco). 3-h 
L01400 Solanum tuberosum Potato hydroxymetl 
Q43826 raphanus sativus (radish), hydroxymeth 
LI 9261 Arabidopsis thallana Arabidopsis thalian 
AB021862 Cucumis melo Cucumis melo mRNA 
F^9058 hevea brasiliensis (para rubber tree). 3-1 
P29057 hevea brasHiensis (para rubt)er tree). 3-1 
P48021 camptotheca acuminata. 3-hydroxy-3-m 
P43256 arabidopsis thaliana (mouse-ear cress). 
P00347 cricetulus griseus (Chinese hamster). 3-1 
L00183 Cricetulus sp. Hamster 3-hydroxy-3-met 
Ml 2705 Mesocricetus auratus Syrian hamster 3* 
P51639 rattus norvegicus (rat). 3-hydroxy-3-nnet 
Q29512 oryctolagus cuniculus (rabbit). 3-hydrox 
Ml 1058 Homo sapiens Human 3-hydroxy-3-met 
M62766 Mus musculus Mouse HMG-CoA reduc 
P2071 5 xenopus laevis (african clawed frog). 3-1 
P16393 strongylocentrotus purpuratus (purple si 
P54960 blattella germanica (german cockroach) 
P14773 drosophila melanogaster (fruit fly). 3-hyc 
P34135 dictyostelium discoideum (slime mold). I 
PI 6237 schistosoma mansoni (blood fluke). 3-hi 
028538 archaeogtobus fulgidus. 3-hydroxy-3-mf 
M24015 Pseudomonas mevalonii P.mevalonii HI 



These sequences, and their truncated counterparts, are 
useful in the present invention. Examples of such preferred 
HMG CoA reductases include the truncated rubber and 
Arabidopsis HMG CoA reductases disclosed herein , 
10 Other enzyme -encoding DNAs can be introduced into 

plants to elevate even further the levels of desirable A5 
sterols and their reduced stanol counterparts as well as 
other phytosterols and tocopherols. Thus, the DNA sec[uences 
contemplated for use in the present invention, which can be 



15 used alone or in various combinations as discussed below. 
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include, but are not limited to, those encoding the 
following enzymes: 3 -hydroxysteroid oxidases; steroid 5- 
reductases; sterol methyltransf erases ; sterol 
acyltransf erases; and S-adenosylmethionine-dependent a- 
5 tocopherol methyltransf erases . 

In each case, the sequences encoding these enzymes can 
comprise an expression cassette comprising, operably linked 
in the 5' to 3 'direction, a seed-specific promoter, the 
enzyme coding sequence, and a transcriptional termination 

10 signal sequence functional in a plant cell such that the 
enzyme is successfully expressed. For use in the. methods 
disclosed herein, the recombinant constructs or expression 
cassettes can be incorporated; in a vector, for example a 
plant expression vector. Such vectors can be transformed 

15 into host cells such as bacterial cells, for example during 
the preparation or modification of the recombinant 
constructs, and plant cells. Thus, the invention 
encompasses plants and seeds comprising such transformed 
plant cells. 

20 It will be apparent to those of skill in the art that 

the nucleic acid sequences set forth herein, either 
explicitly, as in the case of the sequences set forth above, 
or implicitly with respect to nucleic acid sequences 
generally known and not present herein, can be modified due 

25 to the built-in redundancy of the genetic code and 

npncritical areas of the polypeptide that are subject to 
modification and alteration. In this regard, the present 
invention contemplates allelic variants of structural genes 
encoding a polypeptide having HMG-CoA reductase activity. 

30 The previously described DNA segments are noted as 

having a minimal length, as well as total overall length. 
That minimal length defines the length of a DNA segment 
having a sequence that encodes a particular polypeptide 
having HMG-CoA reductase activity. As is well known in the 
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art, as long as the recjuired DNA sequence is present 
(including start and stop signals) , additional base pairs 
can be present at either end of the segment and that segment 
can still be utilised to express the protein. This, of 
5 course, presumes the absence in the segment of an 

operatively linked DNA sequence that represses expression, 
expresses a further product that consumes the enzyme desired 
to be expressed, expresses a product other than the desired 
enzyme or otherwise interferes with the structural gene of 

10 the DNA segment . 

Thus, as long as the DNA segment is free of such 
interfering DNA sequences, a DNA segment of the invention 
can be up to 15,000 base pairs in length. The maximum size 
of a recombinant DNA molecule, particularly a plant 

15 integrating vector, is governed mostly by convenience and 
the vector size that can be accommodated by a host cell, 
once all of the minimal DNA sequences rec[uired for 
replication and expression, when desired, are present. 
Minimal vector sizes are well known. 

20 Also encompassed by the present invention are 

nucleotide sequences biologically functionally equivalent to 
those disclosed herein, that encode conseirvative amino acid 
changes within the amino acid sequences of the presently 
disclosed enzymes, producing "silent" changes therein. Such 

25 nucleotide sequences contain corresponding base 

substitutions based upon the genetic code cotr^ared to the 
nucleotide sequences encoding the presently disclosed 
enzymes . Substitutes for an amino acid within the enzyme 
sequences disclosed herein is selected from other members of 

30 the class to which the naturally occurring amino acid 

belongs. Amino acids can be divided into the following four 
groups: (1) acidic amino acids; (2) basic amino acids; (3) 
neutral polar amino acids; and (4) neutral non-polar amino 
acids. Representative amino acids within these various 
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groups include, but are not limited to: (1) acidic 
(negatively charged) amino acids such as aspartic acid and 
glutamic acid; (2) basic (positively charged) amino acids 
such as arginine, histidine, and lysine; (3) neutral polar 
5 amino acids such as glycine, serine, threonine, cysteine, 
cystine, tyrosine, asparagine, and glutamine; and (4) 
neutral nonpolar (hydrophobic) amino acids such as alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine. 

10 

A. HMG-CoA Reductase 

The introduction of an HMG CoA reductase gene into a 
cell results in a higher carbon* throughput through the 
steroid synthesis pathway. The introduction of a truncated 

15 HMG CoA reductase gene (lacking the transmembrane region, 
resulting in a soluible HMG CoA reductase enzyme) provides 
higher HMG CoA reductase activity and thus increased delta- 5 
steroid compound production over the same case with an 
introduced full-length HMG CoA reductase gene. A useful 

20 truncated HMG CoA reductase nucleic acid encodes at least 
the catalytic domain, 

Hydroxymethylglutaryl-CoA reductase is enzyme number 
1.1.1.88, using the recommended nomenclature of the 
International Union of Biochemistry and Molecular Biology on 

25 the Nomenclature and Classification of Enzymes, Enzyme 

Nomenclature 1992 , Edwin C. Webb, ed. , Academic Press, Inc. 
(San Diego, California: 1992), page 35. 

The present invention contemplates transforming a plant 
cell with a structural gene that encodes a polypeptide 

30 having HMG-CoA reductase activity. The HMG-CoA reductase 
enzymes of both animal and yeast cells comprise three 
distinct amino acid residue sequence regions, which regions 
are designated the catalytic region, the membrane -binding 
region and the linker region. 
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The catalytic region contains the active site of the 
HMG-CoA reductase enzyme and comprises about forty percent 
of the COOH-terminal portion of intact HMG-CoA reductase 
enzyme . 

5 The membrane -binding region contains hydrophobic amino 

acid residues and comprises about fifty percent of the NH2- 
terminal portion of intact HMG-CoA reductase enzyme. 

The linker region connects the catalytic and membrane- 
binding regions, and constitutes the remaining about ten 

10 percent of the intact enzyme. 

As discussed in greater detail below, only the 
catalytic region of HMG-CoA reductase is needed herein to 
provide the desired enzyme activity. Thus, an exogenous 
structural gene that encodes a polypeptide corresponding to 

15 that catalytic region is the minimal HMG Co A reductase gene 
required for transforming plant cells in addition to one of 
the steroid pathway enzymes discussed below. The present 
invention therefore contemplates use of both intact and 
truncated structural genes that encode a polypeptide having 

20 HMG-CoA reductase activity. 

A structural gene encoding a polypeptide having HMG-CoA 
reductase activity can be obtained or const rxicted from a 
variety of sources and by a variety of methodologies. See, 
e.g. Carlson et al . , Cell, 28:145 (1982); Rine et al . , Proc. 

25 Natl. Acad. Sci. USA, 80:6750 (1983). Exemplary of such 

structural genes are the mammalian and yeast genes encoding 
HMG-CoA reductase or the catalytic region thereof. 

The disclosures of Chappell, et al . , U.S. Patent No. 
5,349,12 6, are incorporated in full herein by reference. 

30 The mammalian genome contains a single gene encoding HMG-CoA 
reductase. The nucleotide sequence of the hamster and human 
gene for HMG-CoA reductase have been described in Chappell 
et al . A composite nucleotide sequence of DNA corresponds 
to the mRNA SEQ ID NO:l of Chappell et al . , as well as the 
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derived amino acid residue sequence SEQ ID NO: 2 of Chappell 
et al • , for hamster HMG-CoA reductase is provided in Fig. 2 
of Chappell et al , reprinted from Chin et al . , Nature, 
308:613 (1984). The composite nucleotide sequence of Fig. 

5 2, SEQ ID NO:l of Chappell et al . / comprising about 4768 
base pairs, includes the nucleotide sequence encoding the 
intact hamster HMG-CoA reductase enzyme. 

Intact hamster HMG-CoA reductase comprises about 887 
amino acid residues (SEQ ID NO : 2 of Chappell et al . ) . A 

0 structural gene encoding an intact hamster HMG-CoA reductase 
enzyme of 887 amino acid residues comprises base pairs from 
about nucleotide position 164 to about nucleotide position 
2824 of SEQ ID NO : 1 of Chappell et al . 

A preferred structural gene is one that encodes a 

5 polypeptide corresponding to only the catalytic region of 
the enzyme. Two catalytically active segments of hamster 
HMG-CoA reductase have been defined. Liscum et al., J. 
Biol. Chem. , 260(1) :522 (1985). One segment containing a 
catalytic region has an apparent molecular weight of 62 kDa 

0 and comprises amino acid residues from about position 373 to 
about position 887. A second segment containing a catalytic 
region has an apparent molecular weight of 53 kDa segment 
and comprises amino acid residues from about position 460 to 
about position 887. The 62 kDa. catalytically active segment 

5 is encoded by base pairs from about nucleotide position 1280 
to about nucleotide position 2 824 of SEQ ID NO : 1 of Chappell 
et al . The 53 kDa catalytically active segment is encoded 
by base pairs from about nucleotide position 1541 to about 
nucleotide position 2824 of SEQ ID NO:l of Chappell et al . 

0 In a preferred embodiment, the utilized structural gene 

encodes the catalytic region and at least a portion of the 
linker region of HMG-CoA reductase; The linker region of 
hamster HMG-CoA reductase comprises amino acid residues from 
about position 340 to about position 373 or from about 
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position 34 0 to about position 460, depending upon how the 
catalytic region is defined. These linker regions are 
encoded by base pairs from about nucleotide position 1180 to 
about nucleotide position 1283 or from about position 1180 
5 to about position 1540, respectively of SEQ ID NO:l of 
Chappell et al . The structural gene encoding the linker 
region is operatively linked to the structural gene encoding 
the catalytic region. 

In one particularly preferred embodiment, a structural 

10 gene encoding a catalytically active, truncated HMG-CoA 

reductase enzyme can optionally contain base pairs encoding 
a small portion of the membrane region of the enzyme. 

A structural gene encoding a polypeptide comprising a 
catalytically active, truncated or intact HMG-CoA reductase 

15 enzyme from other organisms such as yeast can also be used 
in accordance with the present invention. 

Yeast cells contain two genes encoding HMG-CoA 
reductase. The two yeast genes, designated HMGl and HMG2 , 
encode two distinct forms of HMG-CoA reductase, designated 

20 HMG-CoA reductase 1 SEQ ID NO: 3 of Chappell et al . are 
presented in Fig. 3 of Chappell et al . , are taken from 
Basson et al . Mol. Cell Biol., 8(9):3797 (1988). The 
nucleotide base sequences of HMG2 SEQ ID NO: 5 of Chappell et 
al . as well as the amino acid residue sequence of HMG-CoA 

25 reductase 2 SEQ ID NO: 6 of Chappell et al . are set forth 
therein in the Sequence Listing. 

The entire HMGl gene comprises about 3360 base pairs 
SEQ ID NO: 3 of Chappell et al . Intact HMG-CoA reductase 1 
comprises an amino acid sequence of about 1054 amino acid 

30 residues SEQ ID NO: 4 of Chappell et al . Thus, the minimal 
portion of the HMGl gene that encodes an intact enzyme 
comprises base pairs from about nucleotide position 121 to 
about position 32 82 of Fig. 3, SEQ ID NO: 3 of Chappell et 
al. 
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The entire HMG2 gene comprises about 3348 base pairs 
SEQ ID NO: 5 of Chiappell et al . Intact HMG-CoA reductase 2 
comprises about 104 5 amino acid residues SEQ ID NO: 6 of 
Chappell et al . Thus, the minimal portion of HMG2 gene that 
5 encodes intact HMG-CoA reductase 2 comprises base pairs from 
about nucleotide position 121 to about position 32 55 of SEQ 
ID NO: 5 of Chappell et al . 

By analogy to the truncated hamster structural gene, 
structural genes encoding polypeptides comprising. 
10 catalytically active, truncated HMG-CoA reductase enzymes 
from yeast can also be used in accordance with the present 
invention. 

The catalytic region of HMG-CoA reductase 1 comprises 
amino acid residues from about residue 618 to about reside 

15 1054: i.e., the COOH- terminus . A structural gene that 

encodes the catalytic region comprises base pairs from about 
nucleotide position 1974 to about position 3282 of Fig, 3 of 
Chappell et al . 

The linker region of HMG-CoA reductase 1 comprises an 

20 amino acid sequence from about residue 525 to about residue 
617. A structural gene that encodes the linker region 
comprises nucleotides from about position 1695 to about 
position 1973 of Fig. 3 of Chappell et al . A structural 
gene encoding the linker region of the enzyme operatively 

25 linked to the structural gene encoding the catalytic region 
of the enzyme . 

Also by analogy to the truncated hamster gene, a 
truncated HMGl gene can optionally contain nucleotide base 
pair sequences encoding a small portion of the membrane- 

30 binding region of the enzyme . Such a structural gene 
preferably comprises base pairs from about nucleotide 
position 121 to about position 147 and from about position 
1695 to about position 3282 of Fig. 3 of Chappell et al . 
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A construct similar to those above from an analogous 
portion of yeast HMG-CoA reductase 2 can also be utilized. 

A nucleic acid sequence encoding HMG-CoA reductase from 
Hevea hrasiliensis has been disclosed by Chye et al . (1991) 
5 Plant Mol. Biol. 16: 567-577. A nucleic acid sequence 

encoding an Arahidopsis thaliana HMG-CoA reductase has been 
published by Caelles et al . (1989) Plant Mol. Biol. 13: 627- 
638, and is also available as GenBank accession number 
L19261. U.S. Patents Nos . 5,306,862 and 5,365,017 disclose 
10 additional DNA sequences encoding HMG-CoA reductases. 

The following sequences are listed by Genbank Accession 
numbers : 

026662 methanobacterium thermoautotrophicum. 3-hydroxy- 
3 - met hylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
15 reductase) . 12/1998 

Q58116 methanococcus jannaschii. 3 -hydroxy- 3- 
methylglutaryl- coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 7/1998 

Q59468 halobacterium volcanii (haloferax volcanii) . 3- 
20 hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 1 . 1 . 1 . 34) 
(hmg-coa reductase) . 7/1998 

008424 sulfolobus solf ataricus . 3-hydroxy-3- 
methylglutaryl- coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 12/1998 
25 M22255 Saccharomyces cerevisiae Yeast HMG-CoA reductase 

(HGiyi2) gene, complete cds; 3 -hydroxy- 3 -methyl glutaryl 
coenzyme A reductase. 4/1993 

M22002 Saccharomyces cerevisiae Yeast HMG-CoA reductase 
(HGMl) gene, complete cds; 3 -hydroxy- 3 -methyl -glutaryl 
30 coenzyme A reductase. 4/1993 

Q12 64 9 phycomyces blakesleeanus . 3 -hydroxy- 3 - 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- coa 
reductase) (fragment) . 11/1997 
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Q12577 fusariummonili forme (gibberella fujikuroi). 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 1.1.1.34) 
(hmg- coa reductase) . 11/1997 

AB012603 Candida utilis Candida utilis HMG mRNA for 
5 HMG-CoA reductase, complete cds. 7/1998 

P34136 dictyostelium discoideum (slime inold) . 3- 
hydroxy- 3 - me thylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (f ragment) . 35735 

PQ0761 hydroxymethylglutaryl-CoA reductase (NADPH) (EC 
10 1.1.1.34) (HMGR 10) - wheat (fragment) 

P48019 oryza sativa (rice) . 3 -hydroxy-3 -methylglutaryl- 
coenzyme a reductase (ec 1.1.1.34) (hmg-coa reductase) 
(fragment) . 2/1996 

024594 zea mays (maize) . 3 -hydroxy-3 -me thylglutaryl 
15 coenzyme a reductase (ec 1.1.1.88). 5/1999 

PQ0763 hydroxymethylglutaryl-CoA reductase (NADPH) (EC 
1.1.1.34) (HMGR 23) - wheat (fragment) 

PQ0762 hydroxymethylglutaryl-CoA reductase (NADPH) (EC 
1.1.1.34) (HMGR 18) - wheat (fragment) 
20 from proprietary soy sequence database 

Q00583 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -me thylglutaryl -coenzyme a reductase 3 (ec 
1.1.1.34) (hmg- coa reductase 3). 7/1998 

Q03163 catharanthus roseus (rosy periwinkle) 
25 (madagascar periwinkle) . 3 -hydroxy-3 -methylglutaryl -coenzyme 
a reductase (ec 1.1.1.34) (hmg-coa reductase). 7/1998 

P48022 lycopersicon esculentum (tomato) . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase 2 (ec 1.1.1.34) (hmg- 
coa reductase 2) . 7/1998 
30 Q01559 nicotiana sylvestris (wood tobacco) . 3-hydroxy- 

3 -methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 7/1998 
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L01400 Solanum tuberosum Potato hydroxymethylglutaryl 
coenzyme A reductase (hmgr) mRNA, complete cds; putative, 
4/1996 

Q43826 raphanus sativus (radish) . 
5 hydroxymethylglutaryl -coa reductase (ec 1.1.1.34) 

(hydroxymethylglutaryl -coa reductase (nadph) ) (3-hydroxy-3- 
methylglutaryl -coenzyme a red 

L19261 Arabidopsis thaliana Arabidopsis thaliana HMG- 
cOA reductase gene, complete cds. 4/1994 
10 AB021862 Cucumis melo Cucumis melo mRNA for HMG-CoA 

reductase, conplete cds; putative. 1/1999 

P2 9058 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase 2 (ec 
1.1.1-34) (hmg- coa reductase 2) (fragment) . 35735 
15 P2 9057 hevea brasiliensis (para rubber tree) . 3- 

hydroxy- 3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1). 7/1998 

P4 8 021 camptotheca acuminata. 3 -hydroxy- 3 - 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
20 reductase) . 11/1997 

P43256 arabidopsis thaliana (mouse-ear cress) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (hmgr2) . 7/1998 

P00347 cricetulus griseus (Chinese hamster) . 3-hydroxy- 
25 3 -methylglutaryl -coenzyme A reductase (ec 1.1.1.34) (hmg-coA 
reductase) . 11/1997 

Ij00183 Cricetulus sp. Hamster 3-hydroxy-3- 
methylglutaryl coenzyme A (HMG CoA) reductase gene, exons 19 
and 20; 3 -hydroxy- 3 -methylglutaryl coenzyme A (HMG CoA) . 
30 4/1993 

M12705 Mesocricetus auratus Syrian hamster 3-hydroxy-3- 
methylglutaral coenzyme A reductase (HMG- CoA reductase) 
mRNA, complete cds; 3 -hydroxy-3 -methylglutaral coenzyme A 
red 
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P51639 rattus norvegicus (rat) . 3-hydroxy-3- 
methylglutaryl- coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 12/1998 

Q29512 oryctolagus cuniculus (rabbit) . 3-hydroxy-3- 
5 methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 7/1999 

M11058 Homo sapiens Human 3-hydroxy-3-methylglutaryl 
coenzyme A reductase mRNA, complete cds; 3-hydroxy-3- 
methylglutaryl coenzyme A reductase . 11/1994 
10 M62766 Mus musculus Mouse HMG-CoA reductase mRNA, 3» 

end. 4/1993 

P2 0715 xenopus laevis (african clawed frog) . 3-hydroxy- 
3 -methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 11/1997 
15 P163 93 strongylocentrotus purpuratus (purple sea 

urchin) . 3 -hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase) . 11/1997 

P54 960 blattella germanica (german cockroach) . 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 1.1.1.34) 
20 (hmg-coa reductase) . 11/1997 

P14773 drosophila melanogaster (fruit fly) . 3-hydroxy- 
3 -methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 12/1998 

P34135 dictyostelium discoideum (slime mold) . 3- 
25 hydroxy- 3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1) . 11/1997 

P1623 7 schistosoma mansoni (blood fluke) . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 7/1998 
30 028538 archaeoglobus f ulgidus . 3-hydroxy-3- 

methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg-coa 
reductase) . 12/1998 
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M24 015 Pseudomonas mevalonii P .mevalonii HMG-CoA 
reductase (mvaA) gene, complete cds; HMG-CoA reductase (EC 
1 .1.1.88) . 4/1993 



5 B. Steroid Pathway Enzytnes 

The present invention contemplates nucleic acid 
sequences encoding polypeptides having the enzyme activity 
of the steroid pathway enzymes squalene epoxidase, sterol 
methyl transferase I, sterol C4-demethylase, obtusifoliol 
10 C14a-demethylase, sterol C5-desaturase and sterol methyl 
transferase II. 



i . Squalene Epoxidase 

Squalene epoxidase (also called squalene monooxygenase) 

15 catalyzes the conversion of squalene to squalene epoxide 
(2 , 3-oxidosqualene) , a precursor to the initial sterol 
molecule in phytosterol biosynthetic pathway, cycloartenol . 
This is the first reported step in the pathway where oxygen 
is required for activity. The formation of squalene epoxide 

20 is also the last common reported step in sterol biosynthesis 
of animals, fiingi and plants. Recently, several homologues 
of Arabidopsis and Brassica squalene epoxidase genes were 
reported (Schafer, U.A. , Reed, D.W. , Hunter, D.G., Yao, K. , 
Weninger, A.M., Tsang, E.W., Reaney, M.J., MacKenzie, S.L., 

25 . and Covello, P.S. (1999). Plant Mol . Biol. 39(4): 721-728). 
The same authors also have a PCT application disclosing the 
use of antisense technology with squalene epoxidase to 
elevate squalene levels in plants (WO 97/34003) . However, to 
date there are no reports on functional characterization of 

30 any plant squalene epoxidase gene or enzyme . 

Squalene Epoxidase, also known as squalene 
monooxygenase is enzyme reference number 1 . 14 , 99 , 7 .[jEnzvme 
Nomenclature 1992 , p. 146. 
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Several squalene epoxidase enzymes are knovm to the 
art. These include Arahidopsis squalene epoxidase protein 
sequence Accession No. AC004786 (SEQ ID NO:l), Ar.ah±dops±s 
squalene epoxidase Accession No. N64916 (SEQ ID NO:2) , and 
5 Arabidopsis squalene epoxidase Accession No. T44667 (SEQ ID 
NO: 3) . Japanese patent application No. 07194381 A discloses 
a DNA encoding a mammalian squalene epoxidase. 
\ In order to facilitate the modifications to sterol 

biosynthesis and accumulation described herein, the present 
10 invention also provides an isolated DNA molecule, comprising 
a nucleotide sequence selected from the group consisting of: 

(a) Arabidopsis squalene epoxidase from clone ID 
ATA506263 disclosure SEQ ID NO : 4 , clone ID ATA304243 
disclosure SEQ ID NO : 6 , clone ID ATA102071 disclosure SEQ ID 

15 NO: 8, clone ATA504158 disclosure SEQ ID NO: 10, or the 
complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency 
equivalent to 0 . 5X SSC to 2X SSC, 0.1% SDS, at 55-65''C, and 

20 which encodes a polypeptide having squalene epoxidase 

enzymatic activity substantially similar to that of the 
disclosed squalene epoxidase; 

(c) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (a) , but which is 

25 degenerate in accordance with the degeneracy of the genetic 
code ; and 

(d) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (b) , but which is 
degenerate in accordance with the degeneracy of the genetic 

30 code . 

An additional aspect of the invention is the 
recombinant constructs and vectors (pMON48343, Fig. 30; 
pMON43844, Fig. 31) comprising nucleic acid sequences 
encoding the novel squalene epoxidase, as well as a method 
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of producing the novel squalene epoxidase, comprising 
culturing a host cell transf oirmed with the novel constructs 
or vectors for a time and under conditions conductive to the 
production of the squalene epoxidase, and recovering the 
5 squalene epoxidase produced thereby. 

ii . Sterol Methyl Transferase I 

S-adenosyl-L-methionine : sterol C-24 methyl transferases 
(SMTl and SMT2) catalyze the transfer of a methyl group from 
10 a cof actor, S-adenosyl-L-methionine, to the C~24 center of 
the sterol side chain (Bach, T.J. and Benveniste, P. (1997), 
Prog. Lipid Res. 36: 197-226). SMT in higher plant cells are 
responsible for their capability to produce a mixture of 24- 
methyl and 24-ethyl sterols (Schaffer, A., Bouvier-Nave, 
15 Benveniste, P., Schaller, H. (2000) Lipids 35: 263-269). 
Functional characterization of the SMT using a yeast Grg6 
expression system demonstrated unambiguously that an SMTl 
sequence encodes a cycloartenol -C24 -methyltransf erase and a 
SMT2 sequence encodes a 24 -methylene lophenol-C24- 
20 methyl transferase in a given plant species (Bouvier-Nave, 
P., Husselstein, T. , and_Benveniste, P. (1998), Eur. J. 
Biochem- 246: 518-529), Several plant genes coding for SMTl 
and SMT2 have been reported and reviewed (Schaffer, A. , 
Bouvier-Nave, Benveniste, P., Schaller, H. (2000) Lipids 35: 
25 263-269) . Transgenic plants expressing homologues of either 
SMTl or SMT2 have been studied (Schaffer, A., Bouvier-Nave, 
Benveniste, P,, Schaller, H. (2000) Lipids 35: 263-269). The 
use of these genes to modify plant sterol composition are 
also covered by two Monsanto patent applications (WO 
30 98/45457 and WO 00/61771) . 

Sterol methyl transferase I enzymes known in the art 
are useful in the present invention. Examplary sec[uences 
include the known AraJbidopsis sterol methyl transferase I 
protein sequence Accession No. U71400 (disclosure SEQ ID 
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NO: 19), the known tobacco sterol methyl transferase I 
protein sequence Accession No, U81312 (disclosure SEQ ID 
NO: 20) and Ricinus coimiunis sterol-C-methyltrans^erase, Eur. 
J. BiochGui., 246(2), 518-529 (1997). (Complete cds, 
5 Accession No. g2246457) , 

S-Adenosyl -L-MethioninB- Sterol -C24 -Methyl transferase- -A 
1 nucleic acid sequence encoding an Arahidopsis thaliana S- 

\ adenosyl-L-methionine-sterol-C24 -methyl transferase has been 

published by Husselstein et al . (1996) FEBS Letters 381: 87- 
10 92. A^'^-sterol C-methyltransf erase is enzyme number 
2.1.1.41, Enzyme Nomenclature 1992 , page 160. 



iii . Sterol C4 -Demethvlase 

15 Sterol 0-4 demethylase catalyses the first of several 

demethylation reactions, which results in the removal of the 
two methyl groups at C-4 . While in animals and fungi the 
removal of the two C-4 methyl groups occurs consecutively, 
in plants it has been reported that there are other steps 

20 between the first and second C-4 demethylations (Bach, T.J. 
and Benveniste, P. (1997), Prog. Lipid Res. 36: 197-226). 
The C-4 demethylation is catalyzed by a complex of 
microsomal enzymes consisting of a monooxygenase , an NAD*- 
dependent sterol 4 -decarboxylase and an NADPH- dependent 3- 

25 ketosteroid reductase . 



iv. Qbtusif oliol C14a-Demethvlase 

Sterol C-14 demethylase catalyzes demethylation at C~14 
which removes the methyl group at C-14 and creates a double 
30 bond at that position. In both, fungi and animals, this is 

the first step in the sterol synthesis pathway. However, in 
higher plants, the 14a-methyl is removed after one C-4 
methyl has disappeared. Thus, while lanosterol is the 
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substrate for C-14 demethylase in animal and fungal cells, 
the plants enzyme uses obtusifoliol as substrate. Sterol 14- 
demethylation is mediated by a cytochrome P-450 complex. The 
mechanism of 14a-methyl removal involves two oxidation steps 
5 leading to an alcohol, then an aldehyde at C-29 and a 

further oxidative step involving a def oxmylation leading to 
formic acid and the sterol product with a typical 8,14-diene 
(Aoyama, Y., Yoshida, Y. , Sonoda, Y., and Sato, Y. (1989) J. 
Biol. Chem. 264: 18502-18505). Obtusifoliol 14a-demethylase 

10 from Sorghum hicolor (L) Moench has been cloned using a 
gene- specif ic probe generated using PGR primers designed 
from an internal 14 amino acid sequence and was functionally 
expressed in E. coli (Bak, S, Kahn, R.A. , Olsen, C.E. and 
Halkier, B.A. (1997) The Plant Journal 11(2): 191-201). 

15 Also, Sa.ccha.romyces cerevisiae CYP51A1 encoding lanosterol- 
14 -demethylase was functionally expressed in tobacco 
(Grausem, B., Ghaubet, N. , Gigot, C., Loper, J.G., and 
Benveniste, P. (1995) The Plant Journal 7(5): 761-770). 

Sterol G-14 demethylase enzymes and sec[uences are known 

20 in the art. For example Sorghum bicolor obtusifoliol 14a- 

demethylase GYP51 mRNA, described in Plant J,, 11(2): 191-201 
(1997) (complete cds Acession No. U74319) . In order to 
facilitate the modifications to sterol biosynthesis and 
accumulation described herein, the present invention also 

25 provides an isolated DNA molecule, having a nucleotide 
sequence selected from the group consisting of: 

(a) obtusifoliol C14a-demethylase from clone ID: 
ATA101105 disclosure SEQ ID NO; 14, clone ID ATA202967 
disclosure SEQ ID NO: 15, clone ID ATA403931 disclosure SEQ 

30 ID NO: 17, or the complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency 
equivalent to 0 . 5X SSC to 2X SSG, 0.1% SDS, at 55-€5°G, and 
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which encodes a polypeptide having obtusifoliol C14a- 
demethylase enzymatic activity substantially similar to that 
of the disclosed obtusifoliol C14a-demethylase; 

(c) a nucleotide sequence encoding the same genetic 

5 information as said nucleotide sequence of (a) , but which is 
degenerate in accordance with the degeneracy of the genetic 
code ; and 

(d) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (b) , but which is 

10 degenerate in accordance with the degeneracy of the genetic 
code . 

An additional aspect of the invention is the 
recombinant constructs and vectors (pMON43 842, Fig. 29) 
comprising nucleic acid sequences encoding the novel 

15 obtusifoliol C14a-demethylase, as well as a method of 
producing the novel obtusifoliol C14a-demethylase, 
comprising culturing a host cell transformed with the novel 
constructs or vectors for a time and under conditions 
conductive to the production of the obtusifoliol C14a- 

20 demethylase, and recovering the obtusifoliol C14a- 
demethylase produced thereby. 

V. Sterol C5-Desaturase 

Sterol C-5 desaturase catalyzes the insertion of the A^- 
25 double bond that normally occurs at the A''-sterol level, 

thereby forming a A^'''-sterol (Parks et al . , Lipids 30:227- 
23 0 (1995) ) . The reaction has been reported to involve the 
stereospecif ic removal of the 5a and 6a hydrogen atoms, 
biosynthetically derived from the 4 pro-R and 5 pro-S 
30 hydrogens of the (+) and (-) R-mevalonic acid, respectively 
(Goodwin, T.W. (1979) Annu. Rev. Plant Physiol. 30: 369- 
4 04) . The reaction is obligatorily aerobic and requires 
NADPH or NADH. The desaturase has been reported to be a 
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multienzyme complex present in microsomes. It consists of 
the desaturase itself, cytochrome bg and a pyridine 
nucleotide-dependent f lavoprotein . The A^-desaturase is 
reported to be a mono -oxygenase that utilizes electrons 
5 derived from a reduced pyridine nucleotide via cytochrome^ 
(Taton, M. , and Rahier, A. (1996) Arch. Biochem. .Biophys. 
325: 279-288). An Arabidopsis thaliana cDNA encoding a 
sterol -C5 -desaturase was cloned by functional 
complementation of a yeast mutant, ergr3 defective in ERG3, 

10 the gene encoding the sterol C5 -desaturase required for 

ergosterol biosynthesis (Gachotte D., Husselstein, T. , Bard, 
M., Lacroute F., and Benveniste, P. (1996) The Plant Journal 
9(3): 391-398). Known sterol C5-desaturase enzymes are 
useful in the present invention, including Arabidopsis 

15 sterol C5-desaturase protein sequence Accession No. X90454, 
disclosure SEQ ID NO: 22, and the Arabidopsis thaliana mRNA 
for sterol-C5-desaturase described in Plant J". 9(3):391-398 
(1996) (complete cds Accession No. gl061037) . 

The NCBI (National Center for Biotechnology 

20 Information) database shows 37 sequences for sterol 

desaturase that are useful in the present invention. The 
following are exemplary of such sequences. From yeast: C5 
sterol desaturase NP_013157 (Saccharomyces cerevisiae) ; 
hypothetical C5 sterol desaturase-f ission T40027 

25 (Schizosaccharomyces porabe) ; C5 sterol desaturase-f ission 
T37759 (Schizosaccharomyces porabe) ; C5 sterol desaturase 
tJQ114 6 (Saccharomyces cerevisiae) ; C5 sterol desaturase 
BAA21457 (schizosaccharomyces pombe) ; C5 sterol desaturase 
CAA22 610 (Schizosaccharomyces pombe) ; putative C5 sterol 

30 desaturase CAA16 898 (Schizosaccharomyces pombe) ; probable C5 
sterol desaturase 013 666 (erg3 _schpo) ; C5 sterol desaturase 
P50860 (Erg3_canga) ; C5 sterol desaturase P32353 
(erg3_yeast) ; C5,6 desaturase AAC99343 (Candida albicans); 
C5 sterol desaturase BAA202 92 (Saccharomyces cerevisiae) ; C5 
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sterol desaturase AAB39844 (Saccharomyces cerevisiae) ; C5 
sterol desaturase AAB29844 (Saccharomyces cerevisiae) ; C5 
sterol desaturase CAA64303 (Saccharomyces cerevisiae) / C5 
sterol desaturase AAA34595 (Saccharomyces cerevisiae) ; C5 
5 sterol desaturase AAA34594 (Saccharomyces cerevisiae) . From 
plants: C5 sterol desaturase S71251 (Arabidopsis thaliana) ; 
i putative sterol -C5-desaturase AAF32466 (Arabidopsis 

\ thaliana) ; sterol -C5-desaturase AAF32465 (Arabidopsis 

thaliana) ; putatuve sterol desaturase AAF22 921 (Arabidopsis 
10 thaliana) ; delta? sterol C5 desaturase (Arabidopsis 
thaliana); sterol C5(6) desaturase homolog AAD20458 
(Nicotiana tabacum) ; sterol C5 desaturase AAD12944 
(Arabidopsis thaliana); sterol C5 , 6 desaturase AAD04034 
(Nicotiana tabacum) ; sterol C5 desaturase CAA62079 
15 (Arabidopsis thaliana). From mammals: sterol-C5-desaturase 
(Mus musculus) BAA33730; sterol -C5 -desaturase BAA33729 (Homo 
sapiens) ; lathosterol oxidase CAB65928 (Leishmania major) ; 
lathosterol oxidase (lathosterol 5-desaturase) 088822 (Mus 
musculus) ; lathosterol 5-desaturase 075845 (Homo sapiens) ; 
, 20 delta? sterol C5 desaturase AAF00544 (Homo sapiens) . 

Others: fungal sterol C5 desaturase homolog BAA18970 (Homo 
sapiens) . 

For DNA sequences encoding a sterol-C5-desaturase 
useful in the present invention, the NCBI_nucleotide search 

25 for "sterol desaturase" came up with 110 sequences. The 
following are exemplary of such sequences. NC_001139 
(Saccharomyces cerevisiae) ; NC_001145 (Saccharomyces 
cerevisiae) ; NC_001144 (Saccharomyces cerevisiae) ; AW7 00015 
(Physcomitrella patens) ; AB004539 (Schizosaccharomyces 

30 pombe) ; and AW596303 (Glycine max) ; AC012188 (Arabidopsis 
thaliana) . 
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vi . Sterol Methvl Transferase II 

The combination of introduction of an HMG-CoA reductase 
gene along with a sterol methyl transferase II gene into a 
cell ser*ves to reduce steroid pathway intermediate compound 
5 accumulation in addition to reducing the accumulation of 24- 
methyl sterols such as campesterol . 

Known sterol methyl transferase II enzymes are useful 
in the present invention, including Arabidopsis sterol 
methyl transferase II protein sequence (complete raRNA cds 
10 from FEES Lett. 381(12) : 87-92 (1996) Accession No. X89867) , 
disclosure SEQ ID NO: 21. 

Recombinant constructs encoding any of the forgoing 
enzymes affecting the steroid biosynthetic pathway can be 
incorporated into recombinant vectors comprising the 
15 recombinant constructs comprising the isolated DNA 
molecules. Such vectors can be bacterial or plant 
expression vectors. 

IV. Recombinant Constructs and Vectors 
20 The present invention contemplates a recombinant 

construct that contains a DNA sequence encoding a 
polypeptide exhibiting 3 -hydroxy-3-methylf luaryl-Coenzyme A 
(HMG-CoA) reductase activity and a DNA sequence encoding a 
polypeptide exhibiting the activity of a steroid pathway 
25 enzyme. Each polypeptide -encoding DNA secfuence is operably 
linked in the 5 ' to 3 ' direction independent of the other 
sequence. Each DNA sequence in the 5' to 3 ' direction 
comprises a promoter, then the DNA secjuence encoding the 
polypeptide then a transcription termination signal 
30 sequence. The steroid pathway enzyme is a squalene 

epoxidase enzyme, a sterol methyl transferase I enzyme, a 
sterol C4-demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. 
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Preferably, the promoters in the recombinant construpt 
are seed-specific promoters. In one embodiment, the 
promoter is derived from a species in a different order from 
the host cell- In other embodiments, the encoded HMG CoA 
5 reductase and/or steroid pathway enzymes is (are) from a 

species in a different order from the order that of the host 
cell . 

It is contemplated that a construct comprises more than 
one of the DNA sequences encoding a steroid pathway enzyme. 

10 The invention also contemplates a recombinant vector 

comprising the above-described recombinant construct, 
wherein that vector is preferably a plant expression vector. 

A recombinant DNA molecule of the present invention can 
be produced by operatively linking a vector to a useful DNA 

15 segment discussed herein to form a plasmid. A vector 

capable of directing the expression of a polypeptide having 
HMG-CoA reductase activity is referred to herein as an HMG- 
CoA reductase "plant integrating vector" . 

Such plant integrating vectors contain control elements 

20 that direct and regulate expression, including a promoter, a 
marker, a terminator and insertion sequence (e.g. FIG. 5) . 
The polypeptide coding genes are operatively linked to the 
plant integrating vector to allow the promoter sequence to 
direct RNA polymerase binding and expression of the desired 

25 polypeptide coding gene. 

Useful in expressing the polypeptide coding gene are 
promoters that are inducible, viral, synthetic, constitutive 
as described by Poszkowski et al . , EMBO J., 3:2719 (1989) 
and Odell et al . , Nature, 313:810 (1985), and temporally 

30 regulated, spatially regulated and spatiotemporally 

regulated as given in Chau et al . , Science, 244:174-181 
(1989) . The promoter preferably comprises a promoter 
sequence whose function in regulating expression of the 
structural gene is substantially unaffected by the amount of 
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sterol or squalene in the cell . As used herein, the term 
^'substantially unaffected" means that the promoter is not 
responsive to direct feedback control by the sterols or 
squalene accumulated in transformed cells or transgenic 
5 plants- 

A promoter is also selected for its ability to direct 
the transformed plant cell's or transgenic plant's 
transcriptional activity to the structural gene encoding a 
polypeptide having HMG-CoA reductase activity. Structural 

10 genes can be driven by a variety of promoters in plant 

tissues. Promoters can be near-constitutive, such as the 
CaMV 3 5S promoter, or tissue specific or development ally 
specific promoters affecting dicots or monocots. 

As exemplified and discussed in detail herein, where 

15 the near-constitutive promoter CaMV 35S is used to transform 
tobacco plants, increases in total sterol and squalene 
accumulation are found in a variety of transformed plant 
tissues (e.g. callus, leaf, seed and root). Alternatively, 
the effects of transformation (e.g. increased amount of a 

20 gene coding for HMG-CoA reductase, increased total sterol 
accumulation and increased squalene accumulation) can be 
directed to specific plant tissues by using plant 
integrating vectors containing a tissue-specific promoter. 
An exemplary tissue- specific promoter is the Lectin 

25 promoter, which is specific for seed tissue. The Lectin 
protein in soybean seeds is encoded by a single gene (Lei) 
that is only expressed during seed maturation and accounts 
for about 2 to about 5 percent of total seed mRNA. The 
Lectin gene and seed- specif ic promoter have been fully 

30 characterized and used to direct seed specific expression in 
transgenic tobacco plants. See, e.g., Vodkin et al . , Cell, 
34:1023 (1983) and Lindstrom et al . , Developmental Genetics, 
11:160 (1990) . 
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A plant integrating vector containing a structural gene 
coding for a polypeptide having HMG-CoA reductase activity 
is engineered to be under control of the Lectin promoter and 
that vector is introduced into soybean plants using a 
5 protoplast transformation method. E.G. Dhir et al . , Plaint 
Cell Reports, 10:97 (1991). The expression of the 
polypeptide having HMG-CoA reductase activity is directed 
specifically to the seeds of the transgenic plant. In this 
way, a transgenic soybean seed having increased squalene 

10 accumulation is produced. Such seeds can then be used to 
extract oil containing enhanced levels of squalene. As set 
forth hereinafter, such squalene -enhanced oil is 
characterized by a greater thermal stability when compared 
to non- squalene -enhanced oil. 

15 In the present invention, a plant has an exogenously 

provided structural gene for HMG-CoA reductase and at least 
one of the six enumerated steroid pathway enzymes, a 
squalene epoxidase enzyme, a sterol methyl transferase I 
enzyme, a sterol C4-demethylase enzyme, a obtusifoliol C14a- 

20 demethylase enzyme, a sterol C5-desaturase enzyme, or a 

sterol methyl transferase II enzyme. The plant or seed thus 
containing these added genes is contemplated, while the 
methods to arrive at a plant or seed according to the 
invention are open to the multitude of methods contemplated 

25 by a person of ordinary skill in the art. In particular, 
all of the added structural genes do not have to have been 
added at the same time, or by the same route. Thus, for 
example, the HMG-CoA reductase activity may result from a 
cross with a plant made according to a process of U.S. 

30 Patent No. 5,349,126, while a steroid pathway enzyme is 

added by nucleic acid bombardment to that plant. Further, 
when more than one nucleotide sequence encoding a steroid 
pathway enzyme is present in a contemplated plant, the 
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expression of the gene does not have to be under the control 
of the same promoter, or even the same type of promoter. 

The choice of which plant integrating vector and 
ultimately to which promoter a polypeptide coding gene is 
5 operatively linked depends directly on the functional 

properties desired, e.g. the location and timing of protein 
expression, and the host cell to be transformed. These are 
well known limitations inherent in the art of constructing 
recombinant DNA molecules. However, a vector useful in 

10 practicing the present invention is capable of directing the 
expression of the polypeptide coding gene, i.e., the gene 
encoding HMG-CoA reductase activity, included in the DNA 
segment to which it is operatively linked. 

Typical vectors useful for expression of genes in 

15 higher plants are well known in the art and include vectors 
derived from the tumor- inducing (Ti) plasmid of 
Agjrobacterium tumefaciens, described by Rogers et al . , Meth. 
Enzymol,, 153:253-277 (1987). However, several other plant 
integrating vector systems are known to function in plants 

20 including pCaMVCN transfer control vector described by Fromm 
et al . Proc. i^at. Acad. Sci. USA, 82:5824 (1985). Plasmid 
pCaMVCN (available from Pharmacia, Piscataway, N.J.) 
includes the cauliflower mosaic virus CaMV 35S promoter. 

The use of retroviral plant integrating vectors to form 

25 the recombinant DNAs of the present invention is also 

contemplated. As used herein, the term ^^retroviral plant 
integrating vector" refers to a DNA molecule that includes a 
promoter sequence derived from the long teirminal repeat 
(LTR) region of a retrovirus genome . 

30 In preferred embodiments, the vector used to express 

the polypeptide coding gene includes a selection marker that 
is effective in a plant cell, preferably a drug resistance 
selection marker. One preferred drug resistance marker is 
the gene whose expression results in kanamycin resistance; 
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i.e., the chimeric gene containing the nopaline synthase 
promoter, Tn5 neomycin phosphotransferase II and nopaline 
synthase 3' nontranslated region described by Rogers et al . , 
in Methods for Plant Molecular Biology, A. Weissbach and H. 
5 Weissbach eds.. Academic Press Inc., San Diego, Calif. 
(1988) . 

i 

■ A variety of methods have been developed to operatively 

\ 

\ link DNA to vectors via complementary cohesive termini or 

blunt ends. For instance, complementary homopolymer tracts 

10 can be added to the DNA segment to be inserted and to the 
vector DNA. The vector and DNA segment are then joined by 
hydrogen bonding between the complementary homopolymer ic 
tails to form recombinant DNA molecules. 

Alternatively, synthetic linkers containing one or more 

15 restriction endonuclease sites can be used to join the DNA 
segment to the plant integrating vector. The synthetic 
linkers are attached to blunt -ended DNA segments by 
incubating the blunt -ended DNA segments with a large excess 
of synthetic linker molecules in the presence of an enzyme 

20 that' is able to catalyze the ligation of blunt-ended DNA 
molecules, such as bacteriophage T4 DNA ligase. Thus, the 
products of the reaction are DNA segments carrying[ synthetic 
linker sequences at their ends. These DNA segments are then 
cleaved with the appropriate restriction endonuclease and 

25 ligated into a plant integrating vector that has been 

cleaved with an enzyme that produces termini compatible with 
those of the synthetic linker. Synthetic linkers containing 
a variety of restriction endonuclease sites are commercially 
available from a number of sources including New England 

30 BioLabs, Beverly, Mass. 

Also contemplated by the present invention are RNA 
equivalents of the above -described recombinant DNA 
molecules . 
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A preferred recombinant DNA molecules utilized in 
accordance with the present invention are pMON53733- 
pMON53740 (Figures 13-20) . 



5 A. Promoters and Target Tissues 

Promoters useful in the present invention include those 
that confer appropriate cellular and temporal specificity of 
expression. Such promoters include those that are 
constitutive or inducible, environmentally- or 

10 developmentally-regulated, or organelle-, cell-, or tissue- 
specific. Preferred promoters for use with the present 
invention promote expression of the introduced enzymes in 
the seed in the cytosol, although expression in plasids or 
organelles of the seeds is also contemplated . 

15 Often-used constitutive promoters include the CaMV 3 5S 

promoter (Odell et al . (1985) Nature 313: 810), the enhanced 
CaMV 3 5S promoter, the Figwort Mosaic Virus (FMV) promoter 
(Richins et al . (1987) NAR 20: 8451), the mannopine synthase 
(mas) promoter, the nopaline synthase (nas) promoter, and 

20 the octopine synthase (ocs) promoter. 

Useful inducible promoters include heat -shock promoters 
(Ou-Lee et al . (1986) Proc. Natl. Acad. Sci. USA 83: 6815; 
Ainley et al . (1990) Plant Mai. Biol. 14: 949), a nitrate- 
inducible promoter derived from the spinach nitrite 

25 reductase gene (Back et al . (1991) Plant Mol . Biol. 17: 9), 
hormone -inducible promoters (Yamaguchi-Shinozaki et al . 
(1990) Plant Mol. Biol. 15: 905; Kares et al . (1990) Plant 
Afol. Biol. 15: 905), and light -inducible promoters 
associated with the small subunit of RuBP carboxylase and 

30 IjHCP gene families (Kuhlemeier et al . (1989) Plant Cell 1: 
471; Feinbaum et al . (1991) Mol. Gen. Genet. 226: 449; 
Weisshaar et al . (1991) EMBO iJ. 10: 1777; Lam and Chua 
(1990) Science 248: 471; Castresana et al . (1988) EMBO J. 7: 
1929; Schulse-Lefert et al . (1989) EMBO J. 8: 651). 
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Examples of useful tissue-specific, developmental ly- 
regulated promoters include fruit-specific promoters such as 
the E4 promoter (Cordes et al . (1989) Plant Cell . 1 : 1025) , 
the E8 promoter (Deikman et al , (1988) EMBO J. 7: 3315), the 
5 kiwifruit actinidin promoter (Lin et al . (1993) PNAS 90: 
5939), the 2A11 promoter (Houck et al . , U.S.. Patent 
4,943,674), and the tomato pZ130 promoter (U.S. Patents 
5,175, 095 and 5,530,185); the p-conglycinin 7S promoter 
(Doyle et al . (1986) J. Biol. Chem. 261: 9228; Slighton and 
10 Beachy (1987) PJanta 172: 356), and seed-specific promoters 
(Knutzon et al . (1992) Proc. Natl. Acad. Sci . USA 89: 2624; 
Bustos et al . (1991) EMBO J. 10: 1469; Lam and Chua (1991) 
J. Biol. Chem. 266: 17131; Stayton et al . (1991) Aust. J. 
Plant. Physiol. 18: 507). Fruit-specific gene regulation is 
15 discussed in U.S. Patent 5,753,475. Other useful seed- 
specific promoters include, but are not limited to, the 
napin, phaseolin, zein, soybean trypsin inhibitor, 7S, 
ADR12, ACP, stearoyl-ACP desaturase, oleosin, Lasguerella 
hydroxylase, and barley aldose reductase promoters (Bartels 
20 (19 95) Plant J". 7: 809-822), the EA9 promoter (U.S. Patent 
5,420,034), and the Bce4 promoter (U.S. Patent 5,530,194). 
Useful embryo- specific promoters include the corn globulin 1 
and oleosin promoters. Useful endosperm-specific promoters 
include the rice glutelin-1 promoter, the promoters for the 
25 low-pl -amylase gene (Amy32b) (Rogers et al . (1984) J*. 

Biol. Chem. 259: 12234), the high-pl -amylase gene (Amy 64) 
(Khurseed et al - (1988) J". Biol. Chem. 263: 18953), and the 
promoter for a barley thiol protease gene ("Aleurain") 
(Whittier et al . (1987) Nucleic Acids Res. 15: 2515). 
30 Appropriate target tissues of plants for enhanced 

production of sterol compounds such as sitosterol, 
sitosterol esters, sitostanol, sitostanol esters, and 
tocopherols, and reduced production of campesterol, 
campestanol, and esters thereof, include, but are not 
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limited to, fruits, flowers, seeds, roots, tubers, leaves, 
stems, buds, and other vegetable parts of plants. Within 
seeds, appropriate organ compartments include the embryo, 
the endosperm, and the aleurone layer. Within any of the 
5 noted target tissues, appropriate cellular compartments 
include, but are not limited to, the cell cytoplasm and 
plastids (e.g., proplastids, chloroplasts, chromoplasts, 
leucoplasts, amyloplasts, etc.). 

10 B. Vectors 

In plants, transformation vectors capable of 
introducing encoding DNAs involved in sterol compound and 
tocopherol biosynthesis are easily designed, and generally 
contain one or more DNA coding sequences of interest under 

15 the transcriptional control of 5 ' and 3 ' regulatory 

sequences. Such vectors generally comprise, operatively 
linked in sequence in the 5' to 3 ' direction, a promoter 
sequence that directs the transcription of a downstream 
heterologous structural DNA in a plant; optionally, a 5' 

20 non-translated leader sequence; a nucleotide sequence that 
encodes a protein of interest; and a 3' non- translated 
region that encodes a polyadenylation signal which functions 
in plant cells to cause the termination of transcription and 
the addition of polyadenylate nucleotides to the 3' end of 

25 the mRNA encoding the protein. Plant transformation vectors 
also generally contain a selectable marker. Typical 5 '-3' 
regulatory sequences include a transcription initiation 
start site, a ribosome binding site, an RNA processing 
signal, a transcription termination site, and/or a 

30 polyadenylation signal. Vectors for plant transformation 
have been reviewed in Rodriguez et al . (1988) Vectors: A 
Survey of Molecular Cloning Vectors and Their Uses, 
Butterworths, Boston; Click et al . (1993) Methods in Plant 
Molecular Biology and Biotechnology CRC Press, Boca Raton, 
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Fla; and Croy (1993) In Plant Moleculajr Biology LahfcUK, 
Hames and Rickwood, Eds./ BIOS Scientific Publishers 
Limited, Oxford, UK. 

The use of transit peptides, e.g. translational fusion 
5 peptides, are not preferred for use in conjunction with the 
enzymes of the present invention, where the sterol 
synthethic compounds are present primarily in the cellular 
cytosol . 

10 V. Cell Transformation and Plant Regeneration 

The amount of a gene coding for a polypeptide having 
HMG-CoA reductase activity is increased by transforming a 
desired plant cell with a suitable vector that contains that 
added exogenous structural gene. Expression of that gene in 

15 the transformed plant cell and transgenic plants developed 
from that transformed plant cell enhances the activity of 
HMG-CoA reductase.. 

Methods for transforming polypeptide -coding genes into 
plant cells include Asrrojbacterium-mediated plant 

20 transformation, protoplast transformation, gene transfer 
into pollen, injection into reproductive organs and 
injection into immature embryos. Each of these methods has 
distinct advantages and disadvantages. Thus, one particular 
methods of introducing genes into a particular plant strain 

25 may not necessarily be the most effective for another plant 
strain, but it is well known which methods are useful for a 
particular plant strain. 

Agrrojbacterium-mediated transfer is a widely applicable 
system for introducing genes into plant cells because the 

30 DNA can be introduced into whole plant tissues, thereby 

bypassing the need for regeneration of an intact plant from 
a protoplast. The use of Agrirobacteriu/n- mediated plant 
integrating vectors to introduce DNA into plant cells is 
well known in the art. See, for example > the methods 
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described by Fraley et al . , Biotechnology, 3:629 (1984) and 
Rogers et al . , Methods in Enzymology, 153:253-277 (1987). 
Further the integration of the T8-DNA is a relatively 
precise process resulting in few rearrangements. The region 
5 of DNA to be transferred is defined by the border sequences, 
and intervening DNA is usually inserted into the plant 
genome as described by Spielmann et al . , Mol • Gen. Genet., 
205:34 (1986) and Jorgensen et al, Mol. Gen. Genet., 207:471 
(1987) . 

10 Modern Agrrobacterium transformation vectors are capable 

of replication in E. coli as well as Agrrobacterium, allowing 
for convenient manipulations as described by Klee et al . , in 
Plant DNA Infectious Agents, T. Hohn and J. Schell, eds . , 
Springer- Verlag, New York (1985) pp. 179-203. 

15 Moreover, recent technological advances in vectors for 

Agrrobacterium-mediated gene transfer have improved the 
arrangement of genes and restriction sites in the vectors to 
facilitate construction of vectors capable of expressing 
various polypeptide coding genes. The vectors described by 

20 Rogers et al . , Meth. Enzymol . , 153:253 (1987), have 

convenient multi-linker regions flanked by a promoter and a 
polyadenylation site for directed expression inserted 
polypeptide conding genes and are suitable for present 
purposes . 

25 In addition, Agrobacteria. containing both armed and 

disarmed Ti genes can be used for the transformations. Both 
types of transf orroing systems are illustrated herein. 
Transf ormants from the former system result in callus from 
which the desired squalene or sterol can be obtained, 

30 whereas transf ormants obtained from the latter, disarmed Ti 
genes can be regenerated into complete transgenic plants 
from whose tissues, e.g. leaf, seed and root, the desired 
chemicals can be obtained. 
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In those plant strains where Agrroibacte^rium-mediated 
transformation is efficient, it is the method of choice 
because of the facile and defined nature of the gene 
transfer . 

5 Agrirojbacterium-mediated transformation of leaf disks and 

other tissues, such as cotyledons and hypocotyls, appears to 
be limited to plant strains that Agrobacaterium naturally 
infects. AgrroJbactejriiiin-mediated trans forroat ion is the most 
efficient in dicotyledonous plants. Few monocots appear to 

10 be natural hosts for Agrr-oJbacteriuin, although transgenic 

plants have been produced in the monocot , asparagus, using 
Agrob^cterium vectors as described by Bytebier et al - , Proc. 
Natl. Ac^d. Sci. USA, 84:5345 (1987). Therefore, 
commercially important cereal grains such as rice, corn, and 

15 wheat must usually be transformed using alternative methods. 
However, as mentioned above, the transformation of asparagus 
using AgrobcLCteriuim can also be achieved. 

A transgenic plant formed using Agjrojbacterium 
transformation methods typically contains a single gene on 

20 one chromosome. Such transgenic plants can be referred to 
as being heterozygous for the added gene. However, inasmuch 
as use of the word "heterozygous" usually implies the 
presence of a complementary gene at the same locus of the 
second chromosome of a pair of chromosomes, and there is no 

25 such gene in a plant containing one more than one added gene 
as here, it is believed that a more accurate name for such a 
plant is an independent segregant, because the added, 
exogenous gene segregates independently during mitosis and 
meiosis. A transgenic plant containing a single structural 

30 gene that encodes a polypeptide having HMG-CoA reductase 
activity and at least one of the enumerated 6 steroid 
pathway enzymes; i.e., and independent segregant, is a 
preferred transgenic plant . 
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More preferred is a transgenic plant that is homozygous 
for the added structural gene; i.e., a transgenic plant that 
contains two added genes, one gene at the same locus on each 
chromosome of a chromosome pair. A homozygous transgenic 
5 plant can be obtained by sexually mating (selfing) an 

independent segregant transgenic plant that contains the 
added genes according to the invention, germinating some of 
the seed produced and analyzing the resulting plants 
produced for enhanced HMG-CoA reductase activity, steroid 
10 pathway product accumulation or both, relative to a control 
(native, non- transgenic) or an independent segregant 
transgenic plant . 

A homozygous transgenic plant exhibits enhanced HMG-GoA 
reductase activity as compared to a native, non- transgenic 
15 plant and an independent segregant transgenic plant . 

It is to be understood that two different transgenic 
plants can also be mated to produce offspring that contain 
two independently segregating added, exogenous genes. 
Selfing of appropriate progeny can produce plants that are 
20 homozygous for both added, exogenous genes that encode a 
polypeptide having HMG-CoA reductase activity. Back- 
crossing to a parental plant and outcrossing with a non- 
transgenic plant are also contemplated. 

25 A. Host Cells and Transformed Plant Cells 

Cells modified according to the present invention are 
contemplated at each stage of the jprocesses of the 
invention. As a result of the invention comprising at least 
two genes, there are several means to accomplish that end. 

30 In some embodiments of the invention, the intermediate steps 
include transformation of nucleic acids comprising some or 
all of the genes into host cells. 

The nucleic acid sequence encoding a polypeptide 
exhibiting HMGR activity does not have to be in the same 
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orientation as a nucleic acid sequence encoding a 
polypeptide exhibiting the activity of a steroid pathway 
enzyme . The coding nucleic acids can be under the control 
of different promoters or be in different orientations. For 
5 the host plant cell useful in carrying out the steroid 
compound biosynthesis according to the invention, the 
minimum that is required is the coding nucleic acids be in 
the same host cell. As long as the HMGR and a steroid 
pathway enzyme coding sequences are present in the same host 

10 cell, they do not have to be on the same DNA molecule or 

under the control of the same promoter, nor do they have to 
be derived from the same vector or construct . 

Host cells are useful for making, storing, reproducing 
or manipulating nucleic acid constructs of the invention. 

15 Contemplated host cells are eukaryotic cells, such as yeast 
or plant cells. Any plant cells can be utilized with the 
present invention. Some particularly useful agriculturally 
significant plant cells are canola, soybean, corn, maize, 
tobacco, cotton, rape, tomato and alfalfa. Other common 

20 plant varieties are carrot, barley, arabidopsis, guayule and 
petunia. Prokaryotic host cells containing constrxicts 
and/or vectors according to the invention are also 
contemplated (e.g. E. coli) . 

One embodiment of the invention is a transformed host 

25 cell containing inter alia a recombinant construct that 
encodes both a DNA sequence encoding a polypeptide 
exhibiting 3 -hydroxy-3 -methylglutaryl-Coenzyme A reductase 
enzyme activity and a DNA sequence encoding a steroid 
pathway enzyme, where the steroid pathway enzyme is as 

30 described in detail above. In a preferred enibodiment, those 
coding DNA sequences are operably linked to a promoter and a 
transcription termination signal sequence. In the coding 
sense direction of the construct, the components of the 
construct are operably linked in the 5 * to 3 ' direction as a 
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promoter, the DNA sequence encoding secjuence and a 
transcription termination signal sequence. 

Another embodiment of the invention is host cell that 
has been transformed with a recombinant vector that has such 
5 a construct. As discussed herein, in one embodiment of the 
invention, such a recombinant vector is a plant expression 
vector. Preferably such a host cell is a plant cell. 

Methods of culturing various eukaryotic and prokaryotic 
cell cultures are well known in the art. The present 
10 invention contemplates cell cultures of transformed host 
cells . Transformed plant cells include transformed 
protoplasts and other types of host cell intermediates as 
well as plant cell cultures. 

Non-limiting examples of plants that can be used in the 
15 practice of the invention include, acacia, alfalfa, aneth, 
apple, apricot, artichoke, arugula, asparagus, avocado, 
banana, barley, beans, beet, blackberry, blueberry, 
broccoli, brussel sprouts, cabbage, canola, cantaloupe, 
carrot, cassava, cauliflower, celery, cherry, chicory, 
20 cilantro, citrus, Clementines, coffee, corn, cotton, 
cucumber, Douglas fir, eggplant, endive, escarole, 
eucalyptus, fennel, figs, garlic, gourd, grape, grapefruit, 
honey dew, jicama, kiwifruit, lettuce, leeks, lemon, lime. 
Loblolly pine, mango, melon, mushroom, nectarine, nut, oat, 
25 oil palm, oil seed rape, okra, onion, orange, ornamental 
plants, papaya, parsley, pea, peach, peanut, pear, pepper, 
persimmon, pine, pineapple, plantain, plum, pomegranate, 
poplar, potato, pumpkin, quince, radiata pine, radicchio, 
radish, raspberry, rice, rye, sorghum. Southern pine, 
30 soybean, spinach, squash, strawberiry, sugarbeet, sugarcane, 
sunflower, sweet potato, sweetgum, tangerine, tea, tobacco, 
tomato, triticale, turf, turnip/ watermelon, wheat, yam, and 
zucchini . 
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Plants particularly attractive for the steroid pathway/ 
modifications described herein include those that produce 
carbon substrates which can be employed for synthesis of 
these compounds. Non-limiting examples of such plants 
5 include various monocots and dicots, including high oil seed 
plants such as high oil seed Brassica (e.g., Brassica nigira, 
Brass ica napus, Brassica hirta, Brassica rapa, Brassica 
campestris, Brassica carinata, and Brassica juncea) , soybean 
(Glycine max) , castor bean {Ricinus communis) , cotton, 

10 saf flower iCarthamus tinctorius) , sunflower {Helianthus 
annuUs) , flax {Linum usitatissimixm) , corn (Zea mays) , 
coconut {Cocos nucifera) , palm {Elaeis guineensis) , oilnut. 
trees such as olive (Olea europaea) , sesame, and peanut 
(Arachis hypogaea) , as well as Arabidopsis, tobacco, wheat, 

15 barley, oats, amaranth, potato, rice, tomato, and legumes 
(e.g., peas, beans, lentils, alfalfa, etc.). 

Enhancement of sitostanol compound production by the 
methods discussed herein is expected to result in yields of 
sitostanol, sitostanol esters, or mixtures thereof in an 

20 amount of at least about 57% by weight, preferably from 

about 57% to about 90% by weight, and more preferably from 
about 57% to about 65% by weight of the total sterol 
compounds present in seed oil . Expressed on a seed dry 
weight basis, sitostanol, sitostanol esters, or mixtures 

25 thereof are expected to be present in an amount of at least 
about 0.08%, preferably from about 0.08% to about 0-8%, and 
more preferably from about 0.08% to about 0.4% of seed dry 
weight . 

B . Processes of Transformation 
30 A variety of different methods can be employed to 

introduce trans format ion/ express ion vectors into plant 
protoplasts, cells, callus tissue, leaf discs, meristems, 
etc., to generate transgenic plants. These methods include, 
for example, Ag-rojbacterium-mediated transformation, particle 
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gun delivery, microinjection, electroporation, polyethylene 
glycol -mediated protoplast transformation, liposome-mediated 
transformation, etc. (reviewed in Potrykus (1991) Ann. Rev. 
PlsLnt Physiol. Plant Mol . Biol. 42: 205). 
5 In general, transgenic plants comprising cells 

containing and expressing nucleic acids encoding enzymes 
facilitating the modifications in sterol compound and 
tocopherol biosynthesis and accumulation described herein 
can be produced by transforming plant cells with a DNA 

10 construct as described above via any of the foregoing 

methods; selecting plant cells that have been transformed on 
a selective medium; regenerating plant cells that have been 
transformed to produce differentiated plants; and selecting 
a transformed plant that expresses the enzyme -encoding 

15 nucleotide sequence (s) at a level such that the amount of 
sitosterol, sitosterol esters, sitostanol, sitostanol 
esters, tocopherol compound (s), and campesterol/campestanol 
and their esters is within the ranges described herein. 

The encoding DNAs can be introduced either in a single 

20 transformation event (all necessary DNAs present on the same 
vector) , a co- trans format ion event (all necessary DNAs 
present on separate vectors that are introduced into plants 
or plant cells simultaneously) , or by independent 
transformation events (all necessary DNAs present on 

25 separate vectors that are introduced into plants or plant 
cells independently) . Traditional breeding methods can 
subsequently be used to incorporate the desired combination 
of enzymes into a single plant, and to produce hybrid 
progeny of the invention plants. 

30 Specific methods for transforming a wide variety of 

dicots and obtaining transgenic plants are well documented 

in the literature (Gasser and Fraley (1989) Science 244: 

12 93; Fisk and Dandekar. (1993) Scientia, Horticul turae 55: 5; 
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Christou (1994) Agro Food Industry Hi TGch, p. 17; and the 
references cited therein) . 

Apomixis is a genetically controlled method , of 
reproduction in plants where the einbryo is formed without 
5 union of an egg and a sperm. There are three basic types of 
apomictic reproduction: 1) apospory where the embryo 
develops from a chromosomal ly unreduced egg in an embryo sac 
derived from the nucellus, 2) diplospory where the embryo 
develops from an unreduced egg in an embryo sac derived from 

10 the megaspore mother cell, and 3) adventitious embryony 

where the embryo develops directly from a somatic cell. In 
most forms of apomixis, psuedogamy or fertilization of the 
polar nuclei to produce endosperm is necessary for seed 
viability. In apospory, a "nurse" cultivar can be used as a 

15 pollen source for endosperm formation in seeds. The nurse 
cultivar does not affect the genetics of the aposporous 
apomictic cultivar since the unreduced egg of the cultivar 
develops parthenogenetically, but makes possible endosperm 
production. 

20 Apomixis is economically important, especially in 

transgenic plants, because it causes any genotype, no matter 
how heterozygous/ to breed true. Thus, with apomictic 
reproduction, heterozygous transgenic plants can maintain 
their genetic fidelity throughout repeated life cycles. 

25 Methods for the production of apomictic plants are known in 
the art. See, U.S. Patent No. 5,811,636 and references 
cited therein which are herein incorporated by reference. 

The present invention also encompasses uniform 
populations of any of the plants discussed herein. 

30 Successful transformation and plant regeneration have 

been achieved in the monocots as follows: asparagus 
(Asparagus officinalis} Bytebier et al . (1987) Proc. Natl. 
Acad. Sci. USA 84: 5345); barley {Hordeum vulgarae; Wan and 
Lemaux (1994) Plant Physiol. 104: 37); maize (Zea mays; 
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Rhodes et al, (1988) Science 240: 204; Gordon-Kamm et al • 

(1990) Plant Cell 2: 603; Fromm et al . (1990) Bio/Technology 
8: 833; Koziel et al . (1993) Bio/Technology 11 : 194); oats 
(Averza sativa; Somers et al . (1992) Bio/Technology 10: 

5 1589); orchardgrass {Dactylis glomerata; Horn et al • (1988) 
Plant Cell Rep. 7: 469); rice {Oryza sativa, including 
indica and japonica varieties; Toriyama et al . (1988) 
Bio/Technology 6: 10; Zhang et al . (1988) Plant Cell Rep. 7: 
3 79; Luo and Wu (1988) Plajit Mol . Biol. Rep. 6: 165; Zhang 
10 and Wu (1988) Theor. Appl . Genet. 76: 835; Christou et al . 

(1991) Bio/Technology 9: 957); rye {Secale cerealej De la 
Pena et al . (1987) Nature 325: 274); sorghum {Sorghum 
bicolor; Cassas et al . (1993) Proc. Natl. Acad. Sci . USA 90: 
11212); sugar cane {Saccharum spp.; Bower and Birch (1992) 

15 Plant tj. 2: 409); tall fescue (Festuca airundinacea ; Wang et 
al . (1992) Bio/Technology 10: 691); turfgrass {Agrastis 
palustrisj Zhong et al . (1993) Plant Cell Rep. 13: 1); and 
wheat {Triticum aestivum; Vasil et al . (1992) Bio/Technology 
10: 667; Weeks et al . (1993) Plant Physiol. 102: 1077; 

20 Becker et al . (1994) Plant J. 5: 299), 

Plant transformation vectors capable of delivering DNAs 
(genomic DNAs, plasmid DNAs, cDNAs, or synthetic DNAs) 
encoding plant -derived or other enzymes that affect the 
biosynthesis and accumulation of sterol compounds and 

25 tocopherols in plants for optimizing the pools of 

sitosterol, sitostanol, esters of either, and tocopherols, 
and for reducing the levels of campesterol, campestanol, 
and/or their esters, can be easily designed by art- 
recognized methods. Various strategies can be employed to 

30 introduce these encoding DNAs into plants to produce 
transgenic plants that biosynthesize and accumulate 
desirable levels of various sterol compounds and 
tocopherols , including : 
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1 . Transforming individual plants with an encoding DNA 
of interest. Two or more transgenic plants, each containing 
one of these DNAs, can then be grown and cross -pollinated so 
as to produce hybrid plants containing the two DNAs. The 

5 hybrid can then be crossed with the remaining transgenic 

plants in order to obtain a hybrid plant containing all DNAs 
of interest within its genome. 

2. Sequentially transforming plants with plasmids 
containing each of the encoding DNAs of interest, 

10 respectively. 

3 . Simultaneously cotrans forming plants with plasmids 
containing each of the encoding DNAs, respectively. 

4 - Transforming plants with a single plasmid 
containing two or more encoding DNAs of interest . 
15 5. Transforming plants by a combination of any of the 

foregoing techniques in order to obtain a plant that 
expresses a desired combination of encoding DNAs of 
interest . 

Traditional breeding of transformed plants produced 
20 according to any one of the foregoing methods by successive 
rounds of crossing can then be carried out to incorporate 
all the desired encoding DNAs in a single homozygous plant 
line (Nawrath et al . (1994) Proc. Natl. Acad. Sci . USA 91: 
12760; PCT International Publication WO 93/02187), or to 
25 produce hybrid offspring- 

In methods 2 and 3 , the use of vectors containing 
different selectable marker genes to facilitate selection of 
plants containing two or more different encoding DNAs is 
advantageous. Examples of useful selectable marker genes 
30 include those conferring resistance to kanamycin, 

hygromycin, sulphonamides, glyphosate, bialaphos, and 
phosphinothricin . 
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C . Processes of Regeneration 

Processes of regeneration of plants from transformed 
protoplasts are known in the art. 



5 D . Transgenic Plants and Progeny 

The present invention contemplates the plants that 
contain the exogenous constructs according to the present 
invention, such that a plant comprises at least one 
transformed plant cell comprising a nucleic acid construct. 

10 The nucleic acid construct, as described above has as 
operably linked components in the 5' to 3 ' direction, a 
promoter, a DNA sequence encoding a polypeptide exhibiting 
3 -hydroxy- 3 -methylglutaryl -Coenzyme A reductase enzyme 
activity, and a transcription termination signal sequence. 

15 The plant also comprises a nucleic acid construct that has 
as operably linked components in the 5' to 3 ■ direction, a 
promoter, a DNA sequence encoding a steroid pathway enzyme, 
and a transcription termination signal sequence. The 
steroid pathway enzyme is a squalene epoxidase enzyme, a 

20 sterol methyl transferase I enzyme, a sterol C4-demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, or a sterol methyl transferase II enzyme. 
In one embodiment, the nucleic acid constructs are 
recombinant constructs . 

25 In one embodiment of the present invention a transgenic 

plant can be produced in accordance with the processes 
discussed elsewhere herein. One method to arrive at the 
above construct -containing plant is to transform the plant 
cell with a recombinant vector harboring such a construct. 

30 Other methods involve direct transfer of the exogenous 

construct into the plant cell. The methods of arriving at a 
plant cell having exogenous nucleic acids are well known in 
art and are applicable to the present invention. In one 
embodiment, the nucleic acid constructs are recombinant 
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constructs. In a preferred embodiment, the recombinant 
vector is a plant expression vector. 

The present invention contemplates a plant, the genome 
of which comprises introduced DNA. That introduced DNA has 
5 at least two components. One component is a DNA encoding a 
3 -hydroxy- 3 - me thyulglutaryl -Coenzyme A reductase enzyme. 
The other component is DNA encoding a steroid pathway enzyme 
that is a squalene epoxidase enzyme, a sterol, methyl 
transferase I enzyme, a sterol C4-demethylase enzyme, a 

10 obtusifoliol C14a-demethylase enzyme, a sterol C5-desaturase 
enzyme, or a sterol methyl transferase II enzyme. The 
storage organs, preferably seeds, of such a plant contain an 
elevated level of total accumulated sterol, compared to 
storage organs of an otherwise identical plant, the genome 

15 of which does not comprise the introduced DNA. The 
introduced DNAs are operatively linked to regulatory 
signals, preferably that cause seed- specif ic expression of 
said introduced DNAs. The seeds of such a plant contain a 
reduced level of squalene, cycloartenol , 24 -methylene 

20 cycloartenol, obtusifoliol, stigmasta-7-enol , or campesterol 
compared to the seeds of an otherwise identical plant or 
compared to a plant comprising an introduced DNA encoding a 
HMG CoA reductase enzyme without the contemplated steroid 
pathway enzyme . 

25 Also contemplated is a plant with introduced DNA, as 

described above, that produces seed having an elevated level 
of a steroid pathway product, compared to a corresponding 
transgenic or non- transgenic plant that does not contain 
said introduced DNA. 

30 The invention also contemplates a plant comprising 

introduced DNA encoding (i) an HMGR enzyme and (ii) a 
squalene epoxidase enzyme, a sterol methyl transferase I 
enzyme, a sterol C4-demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, a sterol 
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methyl transferase II enzyme, or mixtures thereof, wherein 
said plant that produces a storage organ (preferably a seed) 
having an elevated level of a sterol pathway product 
compared to a corresponding transgenic or non- transgenic 
5 plant that does not contain said introduced DNA. 

The invention also contemplates a plant having 
introduced DNA, as described above, that produces a storage 
organ (preferably a seed) having a reduced level of 
squalene, cycloartenol , 24 -methylene cycloartenol , 

10 obtusif oliol, stigmasta-7-enol , campesterol, or mixtures 
thereof, compared to a corresponding transgenic plant that 
comprises introduced DNA encoding an HMGR enzyme but that 
does not contain introduced DNA encoding a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme; a 

15 sterol C4-demethylase enzyme, a obtusif oliol C14a- 

demethylase enzyme, a sterol C5>desaturase enzyme, a sterol 
methyl transferase II enzyme, or mixtures thereof. 

For any of the above plants, an embodiment is 
contemplated wherein the introduced nucleic acid has 

20 regulatory signals that cause seed-specific expression of 
said introduced DNAs . 

The progeny of the above -described plants are also 
considered an embodiment of the present invention, as are 
plant cells or transformed plant cells. Cultures of those 

25 plant cells are also contemplated. Plants produced from 
seeds having introduced DNA are also embodiments of the 
present invention, 

A further embodiment of the present invention is a 
method of producing a plant that accumulates an elevated 

30 level of sterol pathway products, in seed of said plant 
compared to seed of a corresponding plant comprising no 
introduced DNA encoding a peptide, polypeptide, or protein 
that affects the biosynthesis and accumulation of a sterol 
pathway product, comprising sexually crossing a plant having 
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introduced nucleic acid with the corresponding plant having 
no introduced DNA. Plants, including apomictic plants 
produced by this method are contemplated. 

Another embodiment is a seed resulting from a cross of 
5 a plant having introduced DNA, described above, with a nurse 
cultivar. Also contemplated are seeds of any of the above- 
described plants. Also part of the present invention plant 
parts, other than a seed of any of the above -described 
plants . 

10 Uniform populations of the above -described plants are 

also contemplated. 

E . Stability of Transgene Expression 

As several overexpressed enzymes may be required to 

15 produce optimal levels of substrates for the biosynthesis of 
sterol compounds and tocopherols, the phenomenon of co- 
suppression may influence transgene expression in 
transformed plants. Several strategies can be employed to 
avoid this potential problem (Finnegan and McElroy (1994) 

20 Bio/Technology 12: 883) . 

One commonly employed approach is to select and/or 
screen for transgenic plants that contain a single intact 
copy of the transgene or other encoding DNA (Assaad et al . 
(1993) Plant Mol . Biol. 22: 1067; Vaucheret (1993) C.R. 

25 Acad. Sci. Paris, Science de la vie/Life Sciences 316: 1471; 
McElroy and Brettell (1994) TIBTECH 12: 62) . 
AgrroJbacteriu/n-mediated transformation technologies are 
preferred in this regard. 

Inclusion of nuclear scaffold or matrix attachment 

30 regions (MAR) flanking a transgene has been shown to 

increase the level and reduce the variability associated 
with transgene expression in plants (Stief et al . (1989) 
Nature 341: 343; Breyne et al . (1992) Plant Cell 4: 463; 
Allen et al . (1993) Plant Cell 5: 603); Mlynarova et al . 
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Physiol. 110: 15) . Flanking a transgene or other encoding 
DNA with MAR elements may overcome problems associated with 
differential base composition between such transgenes or 
5 encoding DNAs and integrations sites, and/or the detrimental 
effects of sequences adjacent to transgene integration 
sites . 

The use of enhancers from tissue-specific or 
developmentally-regulated genes may ensure that expression 
10 of a linked transgene or other encoding DNA occurs in the 
appropriately regulated manner. 

The use of different combinations of promoters, plastid 
targeting sequences, and selectable markers for introduced 
transgenes or other encoding DNAs can avoid potential 
15 problems due to trans- inacti vat ion in cases where pyramiding 
of different transgenes within a single plant is desired . 

Finally, inactivation by co- suppression can be avoided 
by screening a number of independent transgenic plants to 
identify those that consistently overexpress particular 
20 introduced encoding DNAs (Register et al . (1994) Plant Mol . 
Bial. 25: 951). Site-specific recombination in which the 
endogenous copy of a gene is replaced by the same gene, but 
with altered expression characteristics, should obviate this 
problem (Yoder and Goldsbrough (1994) Bio/Technology 12: 
25 263) . 

Any of the foregoing methods, alone or in combination, 
can be employed in order to insure the stability of 
transgene expression in transgenic plants of the present 
invention. 

30 

F. Hybrid Plants 

The invention contemplates a plant having introduced 
DNA encoding an HMGR and at least one of the six steroid 
pathway enzymes, as described in detail above. It is 
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contemplated that a transgenic plant having UNA encoding an 
HMGR, as is known in the art, might be crossed with a 
transgenic plant having DNA encoding at least one of the six 
steroid pathway enzymes . 
5 Also contemplated as a hybrid plant according to the 

invention is a plant that is a hybrid of a t transgenic plant 
having introduced DNA encoding an HMGR and at least one of 
the six steroid pathway enzymes wherein the plant has been 
hybridized with another strain, yet still retains the 
10 introduced DNA- 

G . Storage Orcrans 

The term "storage organ" as used herein, refers to the 
seeds, fruits or vegetable parts of a plant. Most often the 
15 seed is important for use in the present invention. 

However, there are consumable embodiments, such as with 
potatoes or carrots, where the vegetable parts are 
preferred. 

A contemplated embodiment of the present invention is a 
20 storage organ comprising at least one transfoanned host cell. 
The transformed host cell has at a minimum a constaruct 
according the invention as described above. Also 
contemplated are the embodiments when the construct has 
plant promoters, when the construct is recombinant, when the 
25 construct is part of a vector, and when the vector is a 
plant expression vector. 

The invention contemplates a transgenic plant seed 
transformed with a vector comprising a DNA segment that 
encodes a polypeptide having HMG-CoA reductase activity, and 
30 a DNA segment that encodes a polypeptide having a steroid 

pathway enzyme, wherein the transgenic plant seed is capable 
of germinating into a transgenic plant that over-accumulates- 
steroid pathway products relative to a native, non- 
transgenic plant of the same strain; and mutants, 
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recombinants, genetically engineered derivatives thereof and 
hybrids derived therefrom, wherein said mutants, 
recombinants, genetically engineer derivatives thereof and 
hybrids derived therefrom maintain the ability to 
5 overaccumulate steroid pathway products. 

Seed from a transgenic plant is grown in the field or 
greenhouse, and resulting sexually mature transgenic plants 
are self -pollinated to generate true breeding plants. The 
progeny from these plants become true breeding lines that 
10 are evaluated for steroid compound or squalene accumulation, 
preferably in the field under a range of environmental 
conditions . 

The commercial value of a transgenic plant with 
increased steroid compound or squalene accumulation is 

15 enhanced if many different hybrid combinations are available 
for sale. The user typically grows more than one kind of 
hybrid based on such differences as time to maturity, 
standability or other agronomic traits. Additionally, 
hybrids adapted to one part of a country are not necessarily 

20 adapted to another part because of differences in such 
traits as maturity, disease and herbicide resistance. 
Because of this, steroid compound or squalene accumulation 
is preferably bred into a large number of parental lines so 
that many hybrid combinations can be produced. 

25 Adding an enhanced steroid compound or squalene 

accumulation trait to an agronomically elite line is 
accomplished by a variety of techniques well known to those 
of skill in the art. For example, parent transgenic plants 
that are either homozygous or contain a single independent 

30 segregatable gene that encodes a polypeptide having HMG-CoA 
activity and thus for enhanced sterol or squalene 
accumulation are crossed with lines having other desirable 
traits such as herbicide resistance (U.S. Pat. No. 
4,761,373) produce hybrids. Preferably, transgenic plants 
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homozygous for enhanced sterol or squalene accumulation are 
used to generate hybrids. 

For example, a transgenic plant homozygous for enhance 
sterol accumulation is crossed with a parent plant having 

5 other desired traits. The progeny, which are heterozygous 
or independently segregatable for enhanced sterol 
accumulation and their other desired traits. The 
backcrossing of progeny with the parent may have to be 
repeated more than once to obtain a transgenic plant that 

0 possesses all desirable traits. 

Alternatively, transgenic plants with an enhanced 
sterol or squalene accumulation trait are made multiply 
transgenic by introducing into such plants other genes that 
encode and express other desirable traits, or are mutated as 

5 with radiation, e.g. X-rays or gamma rays, as in U.S. Pat. 
No. 4,616,099, whose disclosures are incorporated by 
reference. Thus, the present invention also contemplates 
mutants and genetically engineered derivatives of transgenic 
plants having enhanced sterol or squalene accumulation. 

3 VI . Harvest 

Besides seed, elevated levels of sterols, phytosterols, 
such as sitosterol, phytostanols , such as sitostanol, and 
esters thereof, can be found in other parts of the plants 
encompassed herein. While the seed- specif ic promoters 

5 contemplated in the present invention function 

preferentially in seed tissues, expression in other plant 
parts can be expected, depending upon the specificity of the 
particular promoter. In this case, promoters functional in 
plant plastids are less desirable than those primarily 

3 directing expression in the cellular cytosol, though it may 
be desirable to use promoters to drive expression of the 
recombinant constructs or expression cassettes disclosed 
herein in tissues and organs other than seeds. For example, 
elevated levels of sterols, phytosterols, etc., can be 
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expected in fruits, as well as vegetable parts of plants 
other than seeds. Vegetable parts of plants include, for 
example, pollen, inflorescences, terminal buds, lateral 
buds, stems, leaves, tubers, and roots. Thus, the present 
5 invention also encompasses these and other parts of the 
plants disclosed herein that contain elevated levels of 
desirable phytosterol, and phytostanol . 

Of course, a significant effect of introducing into 
plants the coding sequences disclosed herein will be on the 
10 content of phytosterols/phytostanols and their esters of 
seed oil. Therefore, additional aspects of the present 
invention include oil obtainable from the seed of the plants 
described herein, and methods for producing such plants and 
oil . Methods for extracting and processing seed oils are 
15 well known in the art . 

Oils produced by the cells, plants, and methods 
disclosed herein are superior in phytosterol /phytostanol 
composition to conventional oils in a variety of ways. Oil 
of the present invention can contain an elevated level of at 
20 least one sterol, at least one phytosterol, at least one 
phytosterol ester, at least one phytostanol, at least one 
phytostanol ester, or mixtures thereof. Preferred compounds 
include sitosterol, sitostanol, and their esters. 

Oil from seed of plants containing and expressing 
25 introduced DNA encoding a sterol methyl transferase II enzyme 
advantageously contains a reduced level of campesterol, at 
least one campesterol ester, campestanol , at least one 
campestanol ester, or mixtures thereof. The sterol 
methyl transferase II -encoding DNA can be introduced alone, 
30 or in combination with other introduced DNA sequences 
encoding enzymes affecting the biosynthesis of steroid 
compounds as discussed herein. Campesterol /campestanol and 
their esters are considered to be undesirable because they 
are readily absorbed in the intestine, while their safety in 
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the blood is unknown. Employing the plants and methods 
disclosed herein, one can obtain seed oil comprising about 
0% to about 19%, preferably about 0% to about 12%/ more 
preferably about 5% to about 9% campesterol, at least one 
5 campesterol ester, campestanol, at least one campestanol 
ester, or mixtures thereof by weight of the total sterol 
compounds of the oil. (The levels of these compounds are 
difficult to express on a percent seed dry weight basis 
because different seeds contain different percentages of 
10 these compounds expressed on this basis) These values 
represent a reduction of about 10% to about 100% in the 
amount of these compounds compared to those in conventional 
oils . 

Introduction into plant cells of the enzyme -encoding 
15 DNA sequences discussed above modifies the biosynthesis of 
sterol compounds carried out by the methods, and in the 
cells, plants, and. seeds, disclosed herein. In particular, 
the expression of an HMG CoA reductase in conjunction with 
DNA sequences for a steroid pathway enzyme is expected to 
20 result in alteration of the steroid pathway product profiles 
in oil as the enhanced steroid pathway throughput produces 
substrates for the enhanced enzyme activity. The novel 
phytostanol ester compositions, e.g., sitostanol ester 
compositions, thus produced constitute another aspect of the 
25 present invention. 

A. Harvest of Steroid Compounds 

Methods for the derivation of steroid compounds from 
cells are well known in the art. The invention contemplates 
30 the recovery of biosynthesized steroid compounds from the 
leaves and/or stems of plants, from plant seeds, from 
plant's vegetative organs, from callouses, and from cell 
cultures of plants, yeasts or eukaryotic cells, 
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Different sources of steroid compounds are preferred 
for various plants. For use as a food or a food component 
as discussed later, the steroid compounds need not be 
isolated or purified to 100 percent purity. Steroid 
5 compound -enriched plants may be utilized directly. 

For example, from tobacco or Ara^bidopsis , it may be 
preferable to extract a pulp of the leaves and stems. From 
tomato, potato, or corn, it may be preferable to use the 
tomato, potato or corn in the foirm of the familiar storage 
10 organs that are typically consumed either directly, or a 
derivative thereof, such as tomato paste, potato flakes, 
vegetable oil and many more that are well known in the food 
science arts. 

If desired, after cultivation, the transgenic plant is 
15 harvested to recover the sterol or squalene product . This 
harvesting step can consist of harvesting a callus culture, 
the entire plant, or only the leaves, or roots of the plant. 
This step can either kill the plant or, if only a non- 
essential portion of the transgenic plant is harvested, can 
20 permit the remainder of the plant to continue to grow. 

In preferred embodiments, this harvesting step further 
comprises the steps of: 



(i) homogenizing at least a sterol -containing 
25 or a squalene -containing portion of the 

transgenic plant to produce a plant pulp and 
using the sterol -or squalene-containing pulp 
directly, as in dried pellets or tablets as 
where an animal food is contemplated; or 

30 

(ii) extracting the squalene or sterol (s) 
from the plant pulp with an appropriate 
solvent such as an organic solvent or by 
supercritical extraction [Favati, et al . , J". 
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Food Sci. 53:1532 (1988) and the citations 
therein] to produce a sterol -or squalene- 
containing liquid solution or suspension; and 

5 (iii) isolating the squalene or sterol (s) 

from the solution or suspension. 

At least a portion of the transgenic plant is 
homogenized to produce a plant pulp using methods well known 

10 to one skilled in the art. This homogenization can be done 
manually, by a machine, or by a chemical means as long as 
the transgenic plant portions are broken up into small 
pieces to produce a plant pulp. This plant pulp consists of 
a mixture of squalene or the sterol of interest, residual 

15 amounts of precursors, cellular particles and cytosol 
contents . This pulp can be dried and compressed into 
pellets or tablets and eaten or otherwise used to derive the 
benefits, or the pulp can be subjected to extraction 
procedures . 

20 The sterol or squalene can be extracted from the plant 

pulp produced above to form a sterol -or- squalene-containing 
solution or suspension. Such extraction processes are 
common and well known to one skilled in this art. For 
example, the extracting step can consist of soaking or 

25 immersing the plant pulp in a suitable solvent. This 

suitable solvent is capable of dissolving or suspending the 
squalene or sterol present in the plant pulp to produce a 
sterol -or squalene-containing solution or suspension. 
Solvents useful for such an extraction process are well 

30 known to those skilled in the art and included several 

organic solvents and combinations thereof such as methanol, 
ethanol , isopropanol , acetone , acetonitrile , tetrahydrof uran 
(THF) , hexane, and chloroform as well as water-organic 
solvent mixtures. A vegetable oil such as peanut, corn. 



02061 072A2J_> 



wo 02/061072 



PCT/US02/00255 



106 

soybean and similar oils can also be used for this 
extraction as can steam distillation. 

A whole plant or callus culture with an added, 
exogenous structural gene for a polypeptide having HMG-CoA 
5 reductase activity is grown under suitable conditions for a 
period of time sufficient for squalene or sterols to be 
synthesized and accumulated. The sterol -squalene-containing 
plant cells, preferably in dried form, are then lysed 
chemically or mechanically, and the squalene or sterol is 

10 extracted from the lysed cells using a liquid organic 

solvent or steam distillation, as described before, to form 
a sterol- or squalene-containing liquid solution or 
suspension by usual means such as chromatography. 

The squalene or sterol is isolated from the solution or 

15 suspension produced above using methods that are well known 
to those skilled in the art of squalene and sterol 
isolation. These methods include, but are not limited to, 
purification procedures based on solubility in various 
liquid media, chromatographic techniques such as column 

20 chromatography and the like. 

The invention contemplates a sitosterol or sitostanol 
ester composition extracted from the seed of a transgenic 
plant of the invention. The invention also contemplates 
such a sitosterol or sitostanol ester wherein an esterifying 

25 fatty acid has 2 to 22 carbon atoms in the main chain. 

B . Harvest of Oil 

The novel biosynthetic composition of the oil in the 
transgenic plants is contemplated. Thus, the present 
30 invention contemplates oil containing at least one sterol 
pathway product, extracted from seed of a described 
transgenic plant. Preferably, sitosterol, at least one 
sitosterol ester, or mixtures thereof, comprise at least 
about 50% by weight of the total sterol compounds of the 
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oil. Preferably, sitosterol, at least one sitosterol ester, 
or mixtures thereof, comprise at least about 0.08% of the 
dry weight of a contemplated seed. Preferably, the oil has 
a reduced amount of squalene, cycloartenol , 24 -methylene 
5 cycloartenol, obtusif oliol , stigmasta-7-enol , campesterol, 
or mixtures thereof, compared to oil from a corresponding 
transgenic plant that does not contain introduced DNA 
encoding a squalene epoxidase enzyme, a sterol methyl 
transferase I enzyme, a sterol C4-demethylase enzyme, a 

10 obtusif oliol C14a-demethylase enzyme, a sterol C5-desaturase 
enzyme, a sterol methyl transferase II enzyme, or mixture 
thereof, and that reduction is in the range of from about 
10% to about 100%. 

Oil is extracted from transgenic plant seeds of the 

15 present invention by method well known in the art. By way 
of example, oil can be extracted from plant seeds using 
extraction methods set forth above for harvesting sterols 
and squalene from transgenic plants. Alternatively, oil can 
be extracted from transgenic plant seeds by usually used 

20 methods for obtaining seed oils such as by crushing he seeds 
to produced a pulp and then pressing the pulp to obtain oil. 
The pulp can also be extracted with appropriate solvents 
(e.g. benzene) to obtain the oil. Industrial ChemistJry: A 
Manual for the Student and Manufacturer, ed. By A. Rogers 

25 and A. B. Aubert, D. Van Nostrand Co., New York, pages 547- 
548 (1912) . 

C. Uses of Oil 

As discussed in the "Description of Related Art," 
30 phytostanols such as sitostanol are beneficial for lowering 
serum cholesterol (Ling et al - (1995) Life Sciences 57: 195- 
206) and preventing cardiac disease. Tocopherols act as 
antioxidants, and play a major role in protecting cells from 
damage caused by free radicals (Halliwell (1997) Nutrition 
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Review 55: 44-60). As the amount of sitostanol in 
conventional vegetable and bran oils is low relative to that 
of other sterol compounds, the oils of the present invention 
are particularly useful for reducing the concentration of 
5 low density lipoprotein cholesterol in plasma. 

Thus, further aspects of the present invention include 
the fol lowing : 

Cholesterol -lowering compositions comprising the oils 
and sitostanol ester compositions disclosed herein. Such 

10 cholesterol -lowering compositions can take the form of, or 
be used in, foods, food products, processed foods, food 
ingredients, food additive compositions, or dietary 
supplements that contain oils and/or fats. Non-limiting 
examples include margarines; butters; shortenings; cooking 

15 oils; frying oils; dressings, such as salad dressings; 
spreads; mayonnaises; and vitamin/mineral supplements. 
Patent documents relating to such compositions include U.S. 
Patents 4,588,717 and 5,244,887, and PCT International 
Publication Nos . WO 96/38047, WO 97/42830, WO 98/06405, and 

20 WO 98/06714- Additional non-limiting examples include 
toppings; dairy products such as cheese and processed 
cheese; processed meat; pastas; sauces; cereals; desserts, 
including frozen and shelf -stable desserts; dips; chips; 
baked goods; pastries; cookies; snack bars; confections; 

25 chocolates; beverages; unextracted seed; and unextracted 
seed that has been ground, cracked, milled, rolled, 
extruded, pelleted, defatted, dehydrated, or otherwise 
processed, but which still contains the oils, etc., 
disclosed herein. 

30 Food additive compositions of the present invention can 

be made by a method comprising obtaining oil containing a 
phytostanol or phytostanol ester selected from sitostanol, 
at least one sitostanol ester, or mixtures thereof, from 
cultured cells, or seeds of a plant, of the present 
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invention, and evenly distributing the oil or desired 
phytostanol compound in finely divided form throughout the 
food product or food additive composition to which it is 
added by dissolution or by suspension in an emulsion. For 
5 example / the oil or phytostanol compound can be dissolved in 
an edible solubilizing agent, or can be mixec^ with an edible 
solubilizing agent, an effective amount of a dispersant, and 
optionally, an effective amount of an antioxidant. Examples 
of useful edible solubilizing agents include, but are not 

10 limited to, monoglycerides, diglycerides, triglycerides, 

vegetable oils, tocopherols, alcohols, polyols, or mixtures 
thereof. Examples of useful antioxidants include, but are 
not limited to, tocopherols, such as -tocopherol, ascorbic 
acid, inexpensive synthetic antioxidants, and mixtures 

15 thereof. Effective carriers for preparing emulsions or 

suspensions include water, alcohols, polyols, other edible 
compounds in which the oil or phytostanol compound is 
soluble or insoluble, and mixtures thereof. Examples of 
useful dispersants include, but are not limited to, 

20 lecithin, other phospholipids, sodium lauryl sulfate, fatty 
acids, salts of fatty acids, fatty acid esters, other 
detergent -like molecules, and mixtures thereof. 
Alternatively, the food additive composition can be made by 
a method comprising obtaining oil containing at least one 

25 tocopherol, and a phytostanol or phytostanol ester selected 
from sitostanol, at least one sitostanol ester, and mixtures 
thereof, from cultured cells, or seed of a plant, of the 
present invention, and mixing the oil with an edible 
solubilizing agent and an effective amount of a dispersant. 

30 Again, the edible solubilizing agent can include, but is not 
limited to, monoglycerides, diglycerides, triglycerides, 
vegetable oils, tocopherols, alcohols, polyols, or mixtures 
thereof, and the dispersant can include, but is not limited 
to, lecithin, other phospholipids, sodium lauryl sulfate, 
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fatty acids, salts of fatty acids, fatty acid esters, other 
detergent -like molecules, and mixtures thereof. 

The cholesterol -lowering compositions can also take the 
form of pharmaceutical compositions comprising a 
5 cholesterol -lowering effective amount of the oils or 

sitostanol ester compositions disclosed herein, along with a 
pharmaceutically acceptable carrier, excipient, or diluent. 
These pharmaceutical compositions can be in the form of a 
liquid or a solid. Liquids can be solutions or suspensions; 
10 solids can be in the form of a powder, a granule, a pill, a 
tablet, a gel, or an extrudate, U.S. Patent 5,270,041 
relates to sterol -containing pharmaceutical compositions. 

Any of the foregoing cholesterol -lowering compositions 
can be used alone or in combination in methods to lower the 
15 risk of developing an elevated plasma concentration of low 
density lipoprotein cholesterol, to lower the plasma 
concentration of low density lipoprotein cholesterol, or to 
treat or prevent an elevated plasma concentration of low 
density lipoprotein cholesterol. Such methods comprise 
20 orally administering to a human or animal subject an 

effective amount of cholesterol -lowering composition. What 
constitutes an effective amount of cholesterol -lowering 
composition can be determined empirically, and depends in 
part on a variety of factors, including the age, weight, 
25 sex, diet, general medical condition of the subject, and the 
severity of hypercholesterolemia. Subjects undergoing 
treatment with the cholesterol -lowering combinations 
disclosed herein can be monitored by routine measurement of 
serum cholesterol levels to determine the effectiveness of 
30 therapy. Continuous analysis of the data obtained in this 
way permits modification of the treatment regimen during 
therapy so that optimal effective amounts of the 
cholesterol -lowering compositions of this invention are 
administered, and so that the duration of treatment can be 
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determined as well . In this way, the treatment 
regimen/dosing schedule can be rationally modified over the 
course of treatment so as to achieve the lowest cholesterol - 
lowering effective amount of the present compositions which 
5 results in satisfactory anti-cholesterolemic effectiveness, 
and so that administration of these compositions is 
continued only so long as is necessary to successfully treat 
this condition. In general, an effective amount of a 
cholesterol -lowering composition of the present invention in 

10 the form of a phytostanol- or phytostanol ester- containing 
composition is in the range of from about 0.1 gm/day to 
about 4.5 gm/day. By way of example, a phytostanol ester 
composition, for example a sitostanol ester composition, can 
be administered in an amount in the range of from about 0 . 1 

15 gm/day to about 4.5 gm/day, preferably from about 1 gm/day 
to about 4.5 gm/day, more preferably from about 2 gm/day to 
about 4.5 gm/day. A phytostanol composition, for example a 
sitostanol composition, can be administered in an amount in 
the range of from about 0.1 gm/day to about 3 gm/day, 

20 preferably from about 1 gm/day to about 3 gm/day, more 
preferably from about 2 gm/day to about 3 gm/day. 

The cholesterol -lowering compositions of the present 
invention can be administered daily to patients in 
accordance with a number of different regimens. 

25 Fundamentally, these compositions should be administered in 
a cholesterol -lowering effective amount for a period of time 
effective to exert their anti -hypercholesterolemic 
preventing, reducing, or reversing action. Administration 
of the present cholesterol -lowering compositions should be 

30 continued until the hypercholesterolemic condition has been 
controlled or eliminated. 

Another method encompassed by the present invention is 
that of achieving or improving effective absorption of 
sitostanol into a host, comprising producing at least one 
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sitostanol ester by any of the methods disclosed herein, and 
administering this sitostanol ester to a host, which can be 
a human or animal . The sitostanol ester can be administered 
by a route selected from oral route, parenteral route, or 
5 topical route. The dose, which can be administered daily, 
can be up to about 10 milligrams of the sitostanol ester per 
kilogram of body weight. U.S. Patent 5,2 02,045 relates to 
the use of stanol fatty acid esters to reduce serum 
cholesterol . 

10 Also envisioned are plants which in addition to having 

increased levels of phytosterols and phytostanols due to the 
presence of constructs comprising sequences encoding 3- 
hydroxy- 3 -me thylglutairyl -Coenzyme A reductase and at least 
one other sterol synthesis pathway enzyme, have increase 

15 levels of tocopherol due to the presence of constructs 

comprising sequences allowing the overexpression of enzymes 
in the tocopherol biosynthetic pathway. 

Tocopherol levels vary in different plants, tissues, 
and developmental stages, indicating a highly regulated 

20 biosynthetic pathway. The production of homogentisic acid 
by p-hydroxyphenyl pyruvate dioxygenase is likely to be a 
regulatory point for bulk flow through the pathway because 
of irreversible enzyme action and because homogentisic acid 
production is the first committed step in tocopherol 

25 biosynthesis (Norris et al , , 1995, Plant Cell 7: 2139-2149). 
The other key regulatoiry step in tocopherol biosynthesis is 
the availability of the phytylpyrophosphate pool. Feeding 
studies (Fuiruya et al . , 1987, Phytochem. , 26: 2741-2747) in 
saf flower callus culture demonstrated 1.8-fold and 18-fold 

30 increases in tocopherol synthesis by feeding homogentisate 
and phytol , respectively. In meadow rescue leaf, vitamin E 
increases in the initial phase of foliar senescence when 
phytol is cleaved off from the chlorophylls and when free 
phytol is available (Peskier et al . , 1989, J". Plant Physiol. 
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135: 428-432). These reports suggest tight coupling of 
tocopherol biosynthesis to the availability of homogentisic 
acid and phytol . 

Transformation of plants with nucleic acid constructs 
5 that increase the biosynthetic activity of the tocopherol 
pathway can lead to increased production of particular 
tocopherol isomers, for example, a- tocopherol , are known in 
the art and can be found, for example, in PCT International 
publication WO 00/61771 which is incorporated herein by 
10 reference. Formation of a-tocopherol from other tocopherols 
occurs due to S-adenosylmethionine (SAM) -dependent 
methylases such as y-tocopherol methyl transferase. 
Overexpression of methyl transferases in combination with 3- 
hydroxy-3-methylglutaryl -Coenzyme A reductase and at least 
15 one other sterol synthesis pathway enzyme as described 

herein is also contemplated in the present methods. Thus, 
any of the DNAs encoding enzymes of the tocopherol 
biosynthetic pathway, discussed above, are useful in the 
present invention. Transformation of plants with an early 
20 tocopherol biosynthesis gene is sufficient to produce plants 
having an elevated level of tocopherols. By "early 
tocopherol biosynthesis gene" is meant DNA encoding 
geranylgeranylpyrophosphate synthase , 
geranylgeranylpyrophosphate hydrogenase, 4- 
25 hydroxyphenyl pyruvate dioxygenase, and phytyl /prenyl 

transferase. DNA encoding enzymes active in later steps of 
tocopherol biosynthesis ("secondary tocopherol biosynthesis 
genes") can be expressed to enhance carbon flux through the 
tocopherol pathway even further, and to produce specific 
30 tocopherol isomers. In this way, the tocopherol 

biosynthetic pathway can be modified to enhance production 
of any tocopherol compound of interest, such as a- 
tocopherol. As noted above, a variety of sources are 
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available for the early tocopherol biosynthesis genes (and 
other tocopherol biosynthesis genes) , and a gene from any of 
these sources can be utilized. If co-suppression occurs 
when a plant gene native to the target host plant is used to 
5 increase expression of a particular enzyme, a coding 

sequence from another source can be used as an alternative. 

Preferred genes for introduction into plants to alter 
tocopherol quantity/quality include 3-deoxy-D-arabino- 
heptulosonate-7-P synthase (DAHP synthase) , shikimate 

10 kinase, either or both of the prephenate dehydrogenases, 1- 
deoxy-d-xylulose 5-phosphate synthetase (DXS) , 1-deoxy-d- 
xylulose 5-phosphate reductoisomerase (DXR) , 4- 
diphosphocytidyl-2C-methyl-d-erythritol synthase (YgbP) , 4- 
diphosphocytidyl-2C-methyl-d-e2rythritol kinase (YchB) , 2C- 

15 methyl -d-erythritol 2 , 4 -cyclodiphosphate synthase (YgbB) , 

the gene product of GcpE, LytB (Altincicek et al . , 2001, J". 
Bacterial., 183:2411-2416; Altincicek et al . , 2001, J. 
Immunol., 166:3655-3658; Campos et al . , 2001, FEES Lett., 
488:170-173) , geranylgeranylpyrophosphate synthase (GGPPS) , 

20 geranylgeranylpyrophosphate hydrogenase (GGH) , 

phytyl /prenyl transferase (PPT) , 4-hydroxy-phenylpyruvate 
dioxygenase (HPPD) , 2-methyl-6-phytylplastoquinol tocopherol 
methyl transferase I (MTI) , tocopherol cyclase, Y" tocopherol 
methyl transferase (GMT) a plant sir 173 6 gene (see Cyanobase 

25 http://www.kazusa.or.jp/cyanbase), a plant sir 173 7 gene (see Cyanobase 

http://www.kazusa.or.jp/cyanbase), an ATPT2 gene (Smith et al . , Plant J"., 
11:83-92, 1977), and an AANTl gene (Saint Guily et al . , 
Plant Physiol., 100:1069-1071, 1992). 

4-hydroxy-phenylpyruvate dioxygenase and 

30 geranylgeranylpyrophosphate hydrogenase will increase the 

homogentisate and phytol pools, respectively. Enzymes that 
control fluxes through pathways are well known to be 
regulated in higher organisms such as plants. Therefore, 4- 
hydroxyphenylpyruvate dioxygenase and 
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geranylgeranylpyrophosphate hydrogenase genes of microbial 
origin which are not subject to regulation in plants, or 
those from higher organisms (plants, algae, fungi, etc.) 
that are deregulated, are especially attractive in this 
5 regard. A non-limiting example is the microbial enzyme 4- 
amino-4 -deoxyprephenate dehydrogenase (TyrA from Erwinia 
harbicola.) which can replace prephenate aminotransferase, 
arogenate dehydrogenase and aminotransferase. 
Overexpression of enzymes such as 3-deoxy-arabino-r 

10 heptulosonate 7-P (DAHP) synthase, prephenate dehydrogenase, 
and shikimate kinase would lead to increases in the levels 
of homogentisate . DNA encoding any of the tocopherol 
biosynthetic enzymes discussed herein can be introduced 
alone or in various combinations to enhance tocopherol 

15 quantity and/or alter tocopherol quality. When introduction 
of multiple enzymes is desirable, preferred combinations 
include, but are not limited to, 4-hydroxyphenylpyruvate 
dioxygenase (HPPD) plus geranylgeranylpyro-phosphate 
hydrogenase (GGH) , geranylgeranylpyrophosphate synthase 

20 (GGPP synthase) plus geranylgeranylpyrophosphate hydrogenase 
(GGH) , 4 -amino -4 -deoxyprephenate dehydrogenase (TyrA) plus 
phytylprenyl transferase (PPT) , geranylgeranylpyro-phosphate 
hydrogenase (GGH) plus phytylprenyltransf erase (PPT) , 
geranylgeranylpyrophosphate synthase (GGPP synthase) plus 

25 phytylprenyltransf erase (PPT) , 2-methyl-6'-phytylplastoquinol 
tocopherol methyltransf erase I (MTI) plus 
phytylprenyltransf erase (PPT) , or 2 -methyl -6- 
phytylplastoquinol tocopherol methyltransf erase I (MTI) , 
phytylprenyltransf erase (PPT) , 4-hydroxyphenylpyruvate 

30 dioxygenase (HPPD) and geranylgeranylpyrophosphate synthase 
(GGPP synthase) . 

Plants characterized by increase levels of sterol and 
tocopherol production can be produced by transforming plant 
cells or tissues genetically altered for increased sterol 
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production by the methods described herein with additional 
nucleic acid constructs encoding tocopherol biosynthetic 
enzymes. Introduction of constructs encoding tocopherol 
pathway enzymes can be accomplished using standard methods 
5 in molecular biology such as those described herein or those 
described in PCT International Publication WO 00/61771. 
Introduction of constructs encoding 3-hydroxy-3- 
methylglutaryl -Coenzyme A reductase, at least one other 
sterol synthesis pathway enzyme, and at least one tocopherol 
10 synthesis pathway enzyme can be accomplished in a single 
transformation or in a series of transformations. For 
example, and without limitation, plant cells transformed 
with constructs encoding 3 -hydroxy-3-methylglutaryl -Coenzyme 
A reductase, at least one other sterol synthesis pathway 
15 enzyme as described herein could be selected and then 

further transformed with additional constructs encoding one 
or more tocopherol synthesis pathway enzymes and in 
particular S-adenosylmethionine- dependent y- tocopherol 
methyl transferase enzyme. Successfully transformed cells 
20 can then be selected and used to regenerate plants having 

increased levels of phytosterols and/or phytostanols as well 
as increased levels of tocopherol. Plants produced can then 
be ^'selfed", a technique well known in the art, to produce 
uniform populations of plants. 
25 Alternatively, plants characterized by increased levels 

of tocopherols and phytosterols and/or phytostanols can be 
produced by traditional plant breeding methods. For 
example, plants transformed with nucleic acid constructs 
encoding 3 -hydroxy-3 -methylglutaryl -Coenzyme A reductase and 
30 at least one other sterol synthesis pathway enzyme can be 
sexually crossed with high tocopherol plants. Any plant 
transformed to produce increased levels of tocopherols and 
in particular a-tocopherols can be used. Non-limiting 

.02061072A2_I_> 



wo 02/061072 



PCT/US02AM>255 



examples include plants produced by the methods described 
above and in PCT International publication WO 00/61771. 
If desired, the plants produced can be selfed to produce 
homozygous, uniform populations of plants. 
5 * -i^eed obtained from the transgenic, progeny, hybrid, 

etc., plants disclosed herein can be used in. methods for 

! 

obtaining oil containing phytosterols , phytosterol esters, 

\ 

• phytostanols, phytostanol esters, or mixtures, thereof along 

with tocopherols employing extraction and processing 

10 procedures known in the art. Note, in this regard, Kochhar 
(1983) Prog. Lipid Res. 22 1 161-188. Alternatively, seeds 
with increased levels of tocopherols and phytosterols, 
phytosterol esters, phytostanols, phytostanol esters, or 
mixtures thereof; or fruits and vegetables with increased 

15 levels of tocopherols and phytosterols, phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof, can 
be used direct ly. 

Tocopherols and phytosterols and/or phytostanols can 
then be obtained from deodorized distillates of oil seed 

20 extracts and in particular soybean oil extracts. Such 

deodorized distillates are expected to contain increased 
levels of both tocopherols and phytosterols and/or 
phytostanol extracts. Oil extracts from plants and seed of 
the present invention are particularly valuable in that they 

25 allow the production of high sterol /tocopherol oils in a 

single process thus resulting in reduced purification costs, 
processing time and waste stream. Methods for the isolation 
of tocopherols and sterols from plant oils are well known in 
the art and can be found, for example, in U.S. Patent Nos . 

30 4,454,329; 5,097,012; 5,594,437; and 5,981,781. 



EXAMPLES 

The following examples are intended to provide 
illustrations of the application of the present invention. 
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The following examples are not intended to completely define 
or otherwise limit the scope of the invention. 

Example 1 ■ Enhancement of Phytosterol content in seeds 
of transgenic plants by seed-specific overexpression of 
5 full-length HMG-CoA reductase (HMGR) 

In order to examine the ability of HMGR overexpression 
for increasing sterol compound levels in seeds, the 
following experiment was performed in Glycine max, A full- 
length HMGR gene from rubber genomic DNA was expressed in 

10 developing Glycine max seeds using the 7S promoter. This was 
achieved by excising the rubber HMGR gene from the plasmid 
pHEVlS (Schaller et al . , (1995) Plant Physiol., 109: 761- 
770) using EcoRI . The 3.8 Kb fragment was inserted into the 
EcoRI site of pMON29920 (Fig. 3) such that the HMGR gene is 

15 flanked by the 7S promoter on the 5' end and the E9 3' 
terminator to create pMON43 80 0 (Fig. 4) . This was next 
digested with Sail and Notl to release a 7 . 7 Kb fragment 
that was then blunt -ended at the Sal I end before ligating 
to pMON23616 (Fig. 5) that was first cut with Smal and I^otl . 

20 This created the pMON43818 binary vector that contains the 
rubber HMGR gene driven by 78 promoter and the NPTII gene 
selection marker driven by the NOS promoter and 3' NOS 
terminator. PMON43818 (Fig. 6) was used to transform 
Agrobacterium tumefaciens and transfoarm Glycine max 

25 cotyledon explants as described in Example 2 . 

Seeds from 15 transgenic plants and one nontransgenic 
control plant were harvested at maturity. Sterol extraction 
and analysis on ten individual seeds per plant were 
performed as described in Example 2 . Results are presented 

30 in Table 2 . 
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Table 2: Sterol profile of transgenic soybean plants 
expressing rubber HMGR gene driven by 7s promoter. Event 
5 licontrol, events 2-16:15 transgenic plants generated by 15 
independent events using Agrrobacterium mediated 
transformation. 

Total sterols increased by 3.2- and 3.9- fold in the 
best performing plants (transgenic events 3 and 4). These 

10 two events also showed the highest increases of individual 
sterols. Campesterol increased by 2.7-fold, sitosterol by 
3. 4 -fold, sitostanol by 3. 2 -fold and other sterols by 6.5- 
fold in event 3 while stigmasterol increased by 2.3-fold in 
event 4. The other sterols, which account for the highest 

15 increase in total sterols were pathway intermediates that 
included squalene, cycloartenol , 2 4 -methylene cycloartenol , 
obtusifoliol, isofucosterol, and stigmasta-7-enol • These 
pathway intermediates normally form minor constituents in 
the sterol composition of seeds. However, in the transgenic 
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seeds, probably due to increased carbon flux, through the 
pathway, they accumulate in significant amounts. This 
suggests additional control points for sterol biosynthesis 
in plants such as squalene epoxidase, C-24 sterol 
5 methyl transferase, and C-14 obtusifoliol demethylase. 

Example 2 : Enhance phytosterol biosynthesis in seeds of 
transgenic soybean plants by seed-specific expression of 
catalytic domain of HMG-CoA Reductase (HMGR) alone and in 
10 combination with sterol methyl transferase II (SMTII) 

In another embodiment of the present invention, 
the levels of sterol compounds, including sitosterol, 
sitostanol, campesterol, stigmasterol and at least one ester 
for each of the sterol compounds and mixture there of, can 
15 be elevated in plant seeds by overexpression of catalytic 
domain of plant-HMG-CoA reductases. One can transform a 
plant of interest using an expression cassette or vector 
comprising DNA encoding a polypeptide exhibiting 3-hydroxy- 
3 -methylglutaryl -Coenzyme A reductase (HMG-CoA reductase or 
HMGR) activity. HMGR cDNAs from rubber have been 
successfully used to increase plant sterol levels in plant 
tissues (Schallet et al . (1995) Plant Physiol. 109: 761- 
770) . Full-length and truncated forms of HMGR CDNAs 
encoding full-length and catalytic domain of HMGR, 
respectively, from Arabidopsis have also been used to 
overproduce sterols in transgenic Arabidapsis plants 
(Gonzalez et al . (1997) Third Terpnet Meeting of the 
European Network on Plant Isoprenoids Abstracts, Abstract 
No. 33, page 33) . In the above examples however, the genes 
have not been specifically targeted to increase sterol 
levels in seeds. Another approach to enhance the 
nutritionally beneficial 24 -ethyl sterols (sitosterol, 
sitostanol) and reduce the accumulation of 24 -methyl sterols 
(campesterol) in seeds one can co-express two genes encoding 
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the enzymes HMGR and sterol methyl transferee II (SMTII) , 
each under the control of seed- specif ic promoter. Here we 
present evidence for such approaches: sterol composition of 
transgenic soybean seeds haboring truncated form (catalytic 
5 domain of HMGR without linker) of Arabidopsis HMGRl is 

presented in Figure 11 and Table 3. Sterol composition of 
transgenic soybean seeds haboring Arahldopsls HMGRl 
(catalytic domain of HMGR without linker) and Arabidopsis 
SMTII is presented in Figure 12 and Table 4. 

10 In order to examine whether overexpression of the 

catalytic domain of HMGR increases sterol levels in the 
seeds of transgenic soybean, the following experiment was 
performed in Glycine ma:K. A truncated form of HMGRl cDNA 
encoding only the catalytic domain of HMGR from Arabidopsis 

15 was expressed in developing seeds of Glycine max using the 
seed- specif ic IS promoter. This was achieved by excising 
the cDNA fragment (HMGRlcd) encoding the HMGRl catalytic 
domain from the plasmid pHMGRlcd (Dale et al . , (1995) Eur. 
J. Biochem. 233: 506-513) using Ndel and Smal enzymes 

20 resulting in the isoloation of a 1.9 Kb fragment. The Ndel 
overhang was filled-in and the 1.9 Kb fragment was blunt- 
end ligated to vector pMON43 818 (Figure 6) , previously Xhol 
(Xhol overhang was filled-in) and Smal digested such that 
the HMGRlcd was flanked by the 7S promoter on the 5' end and 

25 the E9 3' terminator to create a recombinant vector 

pMON43 052 (Figure 7) . This was next digested with Xbal and 
blunt -ended and then digested with Not I to release a 3.4 Kb 
fragment and ligated to pMON51850 (Figure 8) that was 
digeted with Smal and Notl. The ligation created a 

30 recombinant binary vector pMON43057 (Figure 9) that 

contained the cDNA fragment encoding the catalytic domain of 
Arabidopsis HMGRl, driven by 7S promoter and E9 3' 
terminator and the NPTII selectable marker gene driven by 
the NOS promoter and 3' NOS terminator. The pMON43 057 was 
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used for Agrobacterium tumefaclens mediated transformation 
of Glycine max cotyledon explants. The pMON43 058 (Figure 
10) construct carrying both the catalytic domain of 
Arahidopsis HMGRl and Arahidopsis SMTII , both driven by the 
5 7S promoter, was also used for AgrrojbacteriuiT? temefaci ens- 
mediated transformation of Glycine max in a similar manner 
descibed below. 

Explants for transformation were prepared as follows: 
sterilized seeds were germinated on germination medium under 

10 light at 28°C for 5-6 days. Germinated seeds were placed in 
the dark at 4°C for 24 hours prior to excision. Seed coats 
were removed and hypocotyls of each seedling trimmed to a 
length of 0.5 cm to 1.0 cm in length. The cotyledons were 
then split open such that the hypocotyl was split down in 

15 the middle. The primary leaves and apical region of each 
cotyledon was removed to expose the wounding region. 
Wounding was performed with 3-7 shallow, scalpel scores in 
line with the embryo axis, ensuring that the apical bud was 
damaged. Wounded explants were incubated in the culture of 

20 Agrobacterium tumefaciens containing pMON43057. Incubation 
was for 1 hour at room temperature- Innoculated explants 
were then transferred to a co- culture medium and placed 
under light at 23°C for 3-4 days. At this time explants were 
transferred to shooting medium without kanamycin selection 

25 and placed in a 25°C light growth room for 4 days. 

After 4 days on delay, explants were transferred to a 
186 ppm kanamycin selection medium and placed in a 25*^C light 
growth room for 2 weeks. At the end of two weeks explants 
were transferred to 186 ppm Woody Plant medium and placed 

30 again in a 25''C light growth room for another 2 weeks. 

Cultures were transferred every 2 weeks to fresh medium for 
approximately 18-21 weeks. At the 6 week transfer, the 
cotyledons and any dead material were removed from the 
explants, and the petiole was cut. At each s\xbsequent 2 
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week transfer, the petiole was cut to expose fresh cells to 
the medium. 

Transgenic shoots that were approximately in length, 
with 2 nodes, 1 open trifoliate and an active growing point 
5 were selected, cut and transferred to rooting medium. Once 
a good root system was developed, the plants, were sent to 
the greenhouse to grow up in soils in pots. 

Seeds from the 15 transgenic plants and one 
nontransgenic control plant were harvested at maturity. Ten 

10 individual seeds from each plant were weighed and ground 

into fine powder using an electric grinder. A known amount 
of cholestane (usually lOO^g in lOO^il ethanol) was added to 
each approximately 50mg powder sample. Sterol compounds 
were hydrolyzed directly from the ground tissue by 

15 saponification with 2ml of 10% KOH in methanol by refluxing 
the material at 60°C for 30 minutes. The refluxed samples 
were cooled to room temperature and filtered through glass 
wool. An equal volume of water was added to each filtrate, 
and the nonsaponif iables were extracted by partitioning 

20 three times with equal volumes of hexane . The hexane phases 
were pooled and evaporated. The residues were resuspended 
in 1 ml of acetone, and quantatively transferred to glass GC 
vials that were immediately capped. Sterols were analyzed 
by Gas Chromatography -Flame Ionizing Detector using the 

25 following conditions: Inlet temperature of 220*^C, detector 

temperature of 32 0^^0, and column oven temperature programmed 
from 22 0°C to 32 0®C with initial temperature for 1 minute and 
final temperature for 16 minutes and ramp rate of 8°/min. 
The column used was a glass capillary DB-5 column of 50 m 

30 length, 320 \xm diameter, and a film thickness of 0.25 |j.m. 
The carrier gas was helium at a flow rate of l.Oml/min. 
Results are presented in Table 3 and Table 4 . 
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To fully characterise the sterol compounds present in 
the transgenic seeds, a representative sample was also 
analyzed by Gas Chromatograpy-Mass Spectrometry (GC-MS) for 
confirmation of the sterol compounds present. The GC-MS 

5 conditions were as follows: inlet temp, 250°C, detector 
320°C, oven programmed from 180°C to 325''C with initial 
equilibration time of 1.0 min, ramping to 310°C at 4°/min at 
then 20°/min to 325^*0. The column was a DB-5 capillary glass 
column similar to the one used for GC-FID. 

0 Majority of the transgenic lines harboring pMON43057 

showed 3 to 5-fold increase in total sterols. The best 
performing transgenic lines, GM_A13342 and GM_A13634, showed 
6.5- and 6.1-fold increase in total sterols, respectively. 
These lines showed 2- to 2 .6- fold increase in sitosterol, 

5 1.5 to 2,2-fold increase in sitostanol and no significant 
change in the campesterol levels. Hence the major 
proportion of the total sterol increase was accounted by the 
accumulation of pathway intermediates which include 
squalene, cycloartenol , 24 -methylene cycloartenol , 

0 obtusif oliol, isofucosterol, and stigmasta-7-enol . The best 
performing transgenic lines, GM_A13342 and GM_A13634, showed 
32.6- and 32-2-fold increase in pathway intermediates 
accumulation, respectively, as compared to the control. In 
all the transgenic lines harboring the pMON43 057, 50-70% of 

5 the total increase was accounted by the increase in the 
pathway intermediates accumulation as compared to the 
control. The pathway intermediates include squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
isofucosterol, and stigmasta-7-enol . 

) Six transgenic lines haboring pMON43 058 produced 5.8- 

to 6-fold increase in total sterols and the rest of the 10 
transgenic lines with the pMON43058 showed 3- to 5-fold 
increase in total sterols. The best performing transgenic 
lines showed about 2- to 3 -fold increase in sitosterol and 
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4.5- to 6-fold increase in sitostanol levels. However, the 
campesterol accumulation was reduced by 50% in these lines. 
This was due to overexpression of the Ar^bidopsis SMTII 
enzyme which enhances the carbon flux towards the synthesis 
5 of 24 -ethyl sterols thereby reducing the carbon flux through 
the pathway leading to the synthesis of 24-mpthyl sterols. 
• As seen in pMON43 057 transgenic lines, all of the transgenic 

\ lines harboring the pMON43 058 also accumulated 50-60% of the 

total sterols in the form of pathway intermediates which are 

10 squalene, cycloartenol , 24 -methylene cycloartenol , 

obtusif oliol, isofucosterol , and stigmasta-7-enol . These 
pathway intermediates normally form minor constituents in 
the sterol composition of seeds. However, in the transgenic 
seeds, probably due to increased carbon flux through the 

15 pathway, they accumulate in significant amounts. The 

pathway intermediates accumulation is highly significant 
when the truncated from of HMGR is overexpressed as compared 
to the full length form of HMGR suggesting that the 
overexpression of the truncated form of HMGR creates even 

20 greater increase in carbon flux through the pathway. This 
provides further evidence for additional control points for 
sterol biosynthesis in plants such as squalene epoxidase, 
sterol methyltransf erase I, sterol C4-demethylase, 
obtusifoliol C14a-demethylase, sterol C5-desaturase , and 

25 sterol methyl transferase II. 



Examr)le 3 . Enhancement of phytosterol biosynthesis in 
seeds of Arahidopsis transgenic plants by constitutive 
30 expression of different forms of Arahidopsis and rubber HMGR 
enzymes . 

Arahidopsis transgenic plants were generated using 
Agrohacterivm mediated transformation of constructs 
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(pM0N53733, pMON53734, pMON53735, pMON53736, pM0N53737, 
pMON53738, pM0N53739, pMON53740) carrying cDNA encoding 
different forms of Arabidopsis and rubber HMGR enzymes 
driven by CaMV enhanced 35S promoter (Figures 13-20) . The 
5 transformed Arabidopsis seeds carrying each of the above 
constructs were selected on kanamycin (5 0jxg/ml) medium to 
select for transf ormants expressing the selectable marker, 
the NPTII gene driven by CaMV 35S promoter. Kanamycin 
resistant Arabidopsis transgenic plants were grown in green 
10 house for maturity and seeds were collected from each of the 
transgenic lines for sterol analysis. About 50 mg of seeds 
from each transgenic line were weighed, homogenized and used 
for saponification to extract sterols as described in 
Example 2 . 

15 

Figures 21-26 describe the sterol analysis data 
obtained from the transgenic lines carrying each of the 
above constructs. Figure 27 shows the effect on different 
sterol end products and pathway intermediate accumulation 

20 when different foirms of rubber HMGR cDNAs were expressed 
constitutively in transgenic Arabidopsis plants. When 
truncated rubber HMGR (with or without linker region) was 
overexpressed the total sterol accumulation in seeds 
increased by 2.9 to 3. 7- fold as compared with the wild type 

25 control plants. The sterol end products such as campesterol 
and sitosterol showed 1.5 to 2 -fold increase in the lines 
expressing truncated form of rubber HMGR (with and without 
linker) . However the sitostanol end product accumulation in 
the transgenic lines harboring the toruncated form of rubber 

30 HMGR (with and without linker) was enhanced by 2 . 8 to 7- 
fold. There is a significant accumulation of pathway 
intermediates such as cycloartenol and 24 -methylene 
cycloartenol in the seeds of the transgenic lines 
transformed with the truncated form of rubber HMGR (with and 
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without linker region) . The wild type control plants used 
in the experiment do not accumulate both of the pathway 
intermediates . 

5 Example 4. Comparison of Steroid Compounds from HMGR 

Constructs in a Yeast HMGRl Knockout Mutant . 

The effects on the sterol levels of the expression of 
various HMGR constructs expressed in a yeast HMGRl knockout 
mutant were compared. Constructs containing a nucleic acid 

10 encoding the full length HMGR polypeptides from Arabidopsis 
and rubber were compared to those encoding a truncated 
Arabidopsis or rubber HMGR polypeptide that were lacking 
both the membrane binding and linker region of HMGR. The 
control yeast cells were transformed with a similar 

15 construct lacking a polypeptide encoding any form of HMGR. 
Yeast cells transformed with AraJbidopsis HMGR and 
rubber HMGR constructs accumulated approximately the same 
amounts of zymosterol and ergosterol, but more squalene than 
the control yeast . 

20 Transformed yeast cells having rubber HMGR constructs 

accumulated about the same amount of ergosterol, but about 
twice as much squalene and zymosterol than the control 
yeast . 

Transformed yeast cells having Arabidopsis tHMGR 
25 constructs accumulated three times as much squalene, twice 
as much zymosterol, and about 30 percent more ergosterol 
than the control yeast . 

Transformed yeast cells having rubber tHMGR constructs 
accumulated three times as much squalene, four times as much 
30 zymosterol, and about 50 percent more ergosterol than the 
control yeast . 

The data are shown in a Figure 28, "Plant HMGRl 
Contructs in Yeast HMGRl Knockout Mutant". 
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Example 5 , Gene sequences for all genes listed in the 
application 

The sequences obtained from the NCBI public database 
are SEQ ID NO.: 1,2,3,20,21,22,23. These sequences are 
5 included in the appendix and denoted as follows: 
Appendix A= SEQ ID NO, 1, 
Appendix B= SEQ ID NO. 2, 
Appendix C= SEQ ID NO. 3, 
Appendix D= SEQ ID NO. 20, 
10 Appendix E= SEQ ID NO. 21, 

Appendix F= SEQ ID NO. 22, 
Appendix G= SEQ ID NO. 23. 

SEQ ID 1 = Arabidopsis squalene epoxidase protein 
15 sequence (Accession NO: AC004786) See Appendix A 

SEQ ID 2 = Arabidopsis squalene epoxidase (Accession 
NO: N64 916) See Appendix B 

SEQ ID 3 = Arabidopsis squalene epoxidase (Accession 
NO: T44 667) See Appendix C 

20 

SEQ ID 4 = Arabidopsis squalene epoxidase (clone ID: 
ATA5062 63) nucleotide sequence 

GAATTCCCGGGTCGACCCACGCGTCCGCTTATAGATAAGGATATGGCCTTTACGA 

25 ACGTTTGCCTATGGACGCTACTCGCCTTCATGCTGACTTGGACAGTGTTCTACGTCACAA 
ACAGGGGGAAGAAGGCGACGCAGTTGGCGGATGCGGTGGTTGAAGAGCGAGAAGACGGTG 
CTACTGACGTTATCATCGTTGGGGCTGGAGTAGGCGGCTCGGCTCTCGCATATGCTCTTG 
CTAAGGACGGGCGTCGAGTCCATGTAATAGAGAGGGACCTGAGAGAACCAGAGAGAATCA 
TGGGTGAGTTTATGCAACCAGGAGGACGACTCATGCTCTCTAAGCTTGGTCTTGAAGATT 

30 GTTTGGAGGGAATAGATGCCCAAAAAGCCACGGGCATGACAGTTTATAAGGACGGAAAAG 
AAGCAGTCGCATCTTTTCCCGTGGACAACAACAATTTTCCTTTTGATCCTTCGGCTCGAT 
CTTTTCACAATGGCCGATTCGTCO^CGATTGCGGCAAAAGGCTTCTTCTCTTCCC^ 
TGCGCCTGGAAGAAGGAACGGTGAAGTCTTTGATAGAAGAAAAAGGAGTGATCAAAGGAG 
TGACATACAAAAATAGCGCAGGCGAAGAAACAACAGCCTTGGCACCTCTC^ 

35 GCGACGGTTGCTACTCAAACCTTCGCCX3GTCTCTTAATGACAACAATGCGGAGGTTCTGT 
CATACCAAGTTGGTTTTATCTCAAAGAACTGTCAGCTTGAAGAACCCGAAAAGTTAAAGT 
TGATAATGTCTAAACCCTCCTTCACCATGTTGTATCAAATCAGCAGCACCGACGTTCGTT 
GTGTTTTTGAAGTTCTCCCCAACAACATTCCTTCTATTTCAAATGGTGAAATGGCTACTT 
TCGTGAAGAACACTATTGCTCCTCAGGTACCTTTAAAACTCCGCAAAATATTTTTGAAAG 

40 GGATTGATGAAGGAGAACATATAT^AAGCCATGCCAACAAAGAAGATGACAGCTACTTTGA 
GCGAGAAGAT^GGAGTGATTTTATTGGGAGATGCATTCAACATGCGTCATCCAGCAATCG 
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CATCTGGAATGATGGTTTTATTATCTGACATTCTCATTTTACGCCGTCTTCTCCAGCCAT 
TAAGCAACCTTGGCAATGCGCAAAAAATCTCACAAGTTATCAAGTCCTTTTATGATATCC 
GCAAGCCAATGTCAGCGACAGTTAACACGTTAGGAAATGCATTCTCTCAAGTGCTAGTTG 
CATCGACGGACGAAGCAAAAGAGGCAATGAGACAAGGTTGCTATGATTACGTCTCTAGTG 
5 GTGGGTTTCGCACGTCAGGGATGATGGCTTTGCTAGGCGGCATGAACCCTCGTCCGATCT 
CTCTCATCTATCATCTATGTGCTATCACTCTATCCTCCATTGGCCATCTACTCTCTCCAT 
TTCCCTCTCCCCTTGGCATTTGGCATAGCCTTCGACTTTTTGGTTTGGCTATGAAAATGT 
TGGTTCCCCATCTCAAGGCTGAAGGAGTTAGCCAAATGTTGTTTCCAGTCAACGCCGCCG 
CGTATAGCAAAAGCTATATGGCTGCAACGGCTCTTTAAAACACTGGTGCTTTAAACTGCA 
10 AAATATAACACATATATAAATCCCGAATCTTTGTGATTCTGCATATATTGTGTTCTACAA 
TTATTCTCATATAAATGAAAATTGTTCTACGTAAAAGTAAAAAGAAGGAATTGTAATACT 
AATAAAACGAGTTTTTAATTCTGTTGAATGCTTGTGTATATTGGTGAAAAT^AAAAAAAAA 
AAAAAAAAAAAAA2\AAAAAAAAAAGGGCGGCCGC 

15 SEQ ID 5 = Arahidopsis squalene epoxidase (clone ID: 

ATA5062 63) amino acid translation 

EFPGRPTRPLIDKDMAFT]SrVCLWTLIiAFMLTWTVFWTNRGKKATQLADAW 
EDGATDVIIVGAGVGGSALAYALAKDGRRVHVIERDLREPERIMGEFMQPGGRLMLSKLG 

20 LEDCLEG I DAQKATGMTVYKDGKEAVAS F P VDNNNFP FD P S ARS FHNGRFVQRLRQKAS S 
LPNVRLEEGTVKSLIEEKGVIKGVTYKNSAGEETTALAPLTWCDGCYSNLRRSLNDm 
EVLSYQVGFISKNCQLEEPEKLKIjIMSKPSFTMLYQISSTDVRCVFEVLPNNIPSISNGE 
MATFVKNTIAPQVPLKLRKIFIiKGIDEGEHIKAMPTKKMTATLSEKKGVIIjIjGDAFNI^ 
PAIASGMIWLLSDILILRRLLiQPLSNLGNAQKISQVIKSFYDIRKPMSATVNTLGNAFSQ 

25 VLVASTDEAKEAMRQGCYDYLSSGGFRTSGMMALLGGMNPRPISLIYHLCAITLSSIGHL 
LSPFPSPLGIWHSLRLFGLAMKMLVPHLKAEGVSQMLFPVNAAAYSKSYI^^^ 

SEQ ID 6 = Arahidopsis squalene epoxidase (clone ID: 
ATA304243) nucleotide sequence 

30 

GAATTCCCGGGTCGACCCACGCGTCCGCGGACGCGTGGGATTGAGAACAAATAGA 
TTTGGTTATATATGGCTTTTACGCACGTTTGTTTATGGACGTTAGTCGCCTTCGTGCTGA 
CGTGGACGGTGTTCTACCTTACCAACATGAAGAAGAAGGCGACGGATTTGGCTGATACGG 
TGGCTGAGGATCAAAAAGACGGTGCTGCTGACGTCATTATCGTCGGGGCTGGTGTAGGTG 
35 GTTCGGCTCTCGCATATGCTCTTGCTAAGGATGGGCGTCGAGTACATGTGATCGAGAGGG 
ACATGAGAGAACCAGAAAGAATGATGGGTGAGTTTATGCAACCTGGCGGACGACTCATGC 
TTTCTAAACTTGGCCTTCAAGATTGCTTGGAAGACATAGATGCACAGAAAGCCACGGGTT 

tggcagtttataaagatggai\aagaagcagacgcaccttttccagtggataacaacaatt 
tttcttatgaaccttctgctcgatcttttcacaatggccgattcgtccaacaactgcgtc 

40 gaaaggctttttctctttccaatgtgcgcctggaagaaggaacggtgaagtctttactag 
aagaaaaaggagtggtcaaaggagtgacatacaagaataaagaaggcgaagaaacaacag 
ccttggcacctctcactgtggtatgcgacggttgctactcaaaccttcgtcggtctctta 
atgatgacaacaatgctgagattatgtcgtacatagttggttacatctcaaagaattgtc 
ggcttgaagaacccgaaaagctacacttgatattgtctaaaccatctttcaccatggtat 

45 accaaataagcagcactgacgttcgttgtggttttgaggttctccccgaaaattttcctt 
ctattgcaaatggtgaaatgtctactttcatgaagaatactatagttcctcaggtacctc 
caaaactccgcaaaatatttttgaaaggtatagatgagggagcacacataaaagtggtgc 
cggcaaagcgcatgacatctactttaagcaagaagaaaggtgtgattgtattgggagatg 
cattcaatatgcgtcatccagttgttgcatctggaatgatggttttactgtcggacattc 

50 tcattctacgccgtcttcttcagccattaagcaacctcggcgatgcaaacaaagtctcag 
aagttatcaattccttttatgatatccgcaagccaatgtcggcgacggttaacacattgg 
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GAAATGCyVTTTTCTCT^GTACTAATTGGATCAACGGATGAAGCAAAAGAGGCAATGAG 
AGGGTGTCTATGATTACCTTTGTAGTGGCGGGTTTCGTACGTCAGGGATGATGGCTCTGC 
TCGGCGGCATGAATCCTCGTCCTCTCTCTCTCGTCTATCATCTTTGTGCCATCACTCTAT 
CCTCCATTGGCCAACTGCTCTCTCCATTTCCCTCTCCCCTTCGCATTTGGCATAGCCTCA 
5 AGCTTTTTGGTTTGGCCATGAAAATGTTGGTTCCCAATCTCAAAGCTGAAGGAGTTAGCC 
AAATGTTGTTTCCAGCAAATGCAGCCGCGTATCACAAAAGCTATATGGCTGCAACCACTC 
TCTAAACTTTGATGCTCTCAATCGCAATATATATGGAGCACGAATCTATGTGATTGTGCA 
TTTGGTAAACGTGTATTGCAGTGCTTATAATTATTAGTATGTAACGGGGAAAAGTTCTAA 
ACACAAAAAAATAAACTTTTGAATGTTATATGTGTGAATTATTTTTGTTGTTACAAGTAA 
10 TGCTCTTTTTTTTTAGCTTCACACATGTATTATTGGAGCTAATTTTTTGTTTCTCTGTTC 
TTTTATTTTTGTTTTCTTACTGTATTTACTTTGAAT^GTTTCGTTTTATACATATTGGAC 
ATTTTTTAAAAAAAAAAAAAAAAAAAA7^AAAAAAA7U\AAAAAAAAAAAAAAAAJ^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^AAAAAGGGC^ 
CGC 

15 

SEQ ID 7 = Arahidopsis squalene epoxidase (clone ID: 
ATA3 04243) amino acid translation 

MAFTHVCLWTLVAFVLTWTVFYLTNMKKKATDI^ 
20 AYAIAKDGRRVHVIERDMREPERMMGEFMQPGGRLMLSKIjGLQDCLEDIDAQKATG^ 
KI)GKEADAPFPVDNNNFSYEPSARSFHNGRFVQQLRRKAFSLSNWIjEEGTVKSL^^ 
WKGVTYKNKEGEETTAIAPIiTWCDGCYSNLRRSIiNDDm 

PEKLHLILSKPSFTMVYQISSTDWCGFEVLPENFPSIANGEMSTFMKNTIVPQVPPKLR 
KIFLKGIDEGAHIKWPAKRMTSTLSKKKGVIVLGDAFNMRHPW^ 
25 RLLQPLSNLGDANKVSEVINSFYDIRKPMSATVlSrrLGNAPSQVLIGSTDEAK^^ 

DYLCSGGFRTSGMMALLGGI^PRPLSLVYHLCAITLSSIGQLLSPFPSPLRIWHSLKLFG 
liAMKMLVPNLKAEGVSQ MLFPANAAAYHKSYMAATTL* 

SEQ ID 8 = Arahidopsis squalene epoxidase (clone ID: 
30 ATA102071) nucleotide sequence 

AAATCATATTGAGAACAAATAGATTTGGTTATATATGGCTTTTACGCACGTTTGT 
TTATGGACGTTAGTCGCCTTCGTGCTGACGTGGACGGTGTTCTACCTTACCAACATGAAG 
AAGAAGGCGACGGATTTGGCTGATACGGTGGCTGAGGATCAAAAAGACGGTGCTGCTGAC 
35 GTCATTATCGTCGGGGCTGGTGTAGGTGGTTCGGCTCTCGCATATGCTCTGCTAAGTGTG 
CGCCTGGAAGAAGGAACGGTGAAGTCTTTACTAGAAGAAAAAGGAGTGGTCAAAGGAGTG 
ACATACAAGAATAAAGAATGCGAACAAACAACAGCCTTGGCACCTCTCACTGTGGTATGC 
GACGGTTGCTAATCAAACCTTCGTCGGTCTCTTAATG 

40 SEQ ID 9 = Arahidopsis squalene epoxidase (clone ID: 

ATA102071) amino acid translation 

MAFTHVCLWTLVAFVLTWTVFYLTNMKKKATDL^ 
GGSAIiAYALLSWLEEGTVKSLLEEKGVVKGVTYKNKECEQTTAIiAPLTW 

45 

SEQ ID 10 = Arahidopsis squalene epoxidase (clone ID: 
ATA504158) nucleotide sequence 

50 CACAAAGCT^AAAAAATCTCTGTAAAAGO^GAACGATAATGGAGTCAC^ 

AATTGGATCTTACCTCTTTTGATCTCTTCTCTCCTCATCTCCTTCGTCGCTTTCTATGGA 
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TTCTTCGTCAAACCGAAGCGGAACGGTCTCCGTCACGATCGGAAAACTGTTTCTACCGTC 
ACCTCCGACGTCGGATCTGTTAATATTACCGGAGATACTGTCGCTGATGTCATTGTTGTT 
GGAGCTGGTGTTGCTGGTTCTGCTCTTGCTTATACTCTTGGAAAGGGGAAATTTAAACGC 
CGAGTTCATGTGATTGAAAGAGATTTATCGGAGCCTGATCGTATTGTTGGGGAGTTGTTA 
5 CAGCCTGNGGGTTACCTCAAGTTACTGGAGTGTGGAATTGGAGATTGTGTGGAAGAAATA 
GATGCTCAGCNTGTGTATGGTTATGCACTTTTTAAAAATGGG 



SEQ ID 11 = Arabidopsis squalene epoxidase (clone ID: 
ATA504158) amino acid translation 

10 TKQKNLCKSRTIMESQLWNWILPLLISSLLISFVAFYGFFVKPKRNGLRHDRKTV 
STVTSDVGSVNITGDTVADVIWGAGVAGSAIiAYTLGKGKFKRRVHVIERDLSEPDRIVG 
EliLQPXGYIiKLLECGIGDCVEEIDAQXVYGYALFKNG 



15 SEQ ID 12 = Arahidopsis obtusifoliol C14a-demethYlase 

nucleotide sequence (Accession NO: complement, join 
AC002329 : 37461...38456, AGO 023 2 9 : 3 912 1...3 954 6 ) (homolog of 
sorghum obtusifoliol C14a-demethylase) nucleotide sequence 

20 CGTGTTTTACAAATTTCCTTTGTTGGTTTTCCACAGATTTAAAGAACCCTAACGA 
GAGAAAAAAATGGACTGGGATTACTATACGCTGTTGAAGACGAGTGTGGCTATTATTATA 
GTGTTTGTTGTGGCCAAACTCATAACCTCCTCCAAATCCAAGAAGAAAACAAGTGTCGTC 
CCACTCCCTCCAGTTCTTCAAGCGTGGCCTCCATTTATCGGATCCCTAATCCGCTTCATG 
AAAGGTCCAATAGTGCTACTTAGAGAGGAATATCCTAAGCTTGGAAGTGTTTTCACAGTG 

25 AAGCTTCTTCACAAAAACATCACTTTTCTCATCGGTCCCGAAGTCTCGTCCCACTTTTTC 
AACGCTTATGAATCTGAACTCAGCCAGAAAGAAATTTACAAATTTAATGTGCCTACTTTT 
GGCCCCGGAGTTGTGTTTGATGTTGACTATCCCGTTCGGATGGAGCAGTTCCGATTCTTC 
TCCAGCGCTCTCAAGGATTACTTCTCAAAATGGGGAGAAAGTGGGGAAGTGGATCTAAAG 
GCCGAGTTAGAGCGTCTAATCACCTTGACTGCTAGTAGATGTCTATTGGGTCGAGAAGTC 

30 CGTGACCAACTTTTTGATGATGTTGCTCCATTGTTCCATGACCTTGATAAAGGCATGCAA 
CCCATAAGTGTCATCTTCCCAAAGCTCCCCATTCCAGCTCACAATTGTCGTGACCGTGCT 
CGCGGAAAGATTGCAAAAATCTTTTCAAACATCATAGCAACAAGAAAACGCTCTGGTGAC 
AAATCAGAGAACGACATGCTACAATGTTTCATCGACTCAT^GTACAAAGACGGTAGAGAG 
ACAACTGAATCTGAAGTAACTGGTTTGCTCATTGCTGGTTTGTTTGCAGGACAACATACA 

35 AGCTCTATCACTGCCACATGGACCGGTGCTTATCTAATTCAAAACAAACACTGGTGGTCC 
GCGGCTTTGGACGAGCAGAAGAAACTGATTGGAAAACATGGGGACAAGATCGACTACGAT 
GTTTTGTCTGAGATGGATTTTCTGTTTCGCAGTGCAAAAGAAGCTTTAAGGCTTCACCCT 
CCAAAGATCTTACTGCTGAGAACAGTACACAGTGATTTCACCGTGACAACTCGAGAAGGA 
AAGCAATATGAGATACCAAAGGGTCATATCGTTGCAACTTCTCCTGCATTCGCCAACCGC 

40 TTACCTCATGTCTACAAAGATCCGGAAAATTTTGATCCGGATAGATTTTCAAAGGAAAGA 
GAAGAGGATAAAGCAGCTGGTTCGTGTTCATACATCTCTTTGGGAGCTGGTAGGCACGAG 
TGTCCTGGTGGATCATTTGCGTTCTTGCAGATCAAAGCCGTATGGTGTCACTTATTGAGA 
AACTTTGAGCTTGAGTTAGTGTCACCTTTCCCTGAAATCAACTGGAATGCTTTGGTCGTT 
GGTGCTAAAGGAAATGTCATGGTTCGTTACAAGCGTCGTCCCTTTTCTTAA 

45 

SEQ ID 13 = AraJbidopsis obtusifoliol C14a-demethylase 
nucleotide sequence (Accession NO: complement, join 
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AC002329:37461,..38456, AC002329 : 39121...39546) (homolog of 
sorghum obtusifoliol C14a-demethylase) amino acid 
t r ans 1 a t i on 

5 MDWDYYTLLKTSVAI 1 1 VFWAKLITSSKSKKKTSWPLPPVLQAWPPFIGSLIR 

FMKGPIVLLREEYPKLGSVFTVKLLHKNITFLIGPEVSSHFFNAYESELSQKEIYKFWP 
TFGPGVWDVDYPVRMEQFRFFS S ALKDYFS KWGESGEVDLKAELERLI TLT^ 
EVRDQLFDDVAPLFHDLDKGMQP I S VI FPKLPI PAHNCRDRARGKI AKI FSNI I ATRKRS 
GDKS ENDMLQC F I DS KYKDGRETTE SEVTGLLI AGLFAGQHT SSI TATWTGAYL I QNKHW 
10 WSAALDEQKKLIGKHGDKIDYDVLSEMDFLFRSAKEALRLHPPKILLLRTVHSDFTVTTR 
EGKQYEIPKGHIVATSPAFAJSTRIjPHVYKDPENFDPDRFSKEREEDKAAGSCSYISIi^ 
HECPGGS FAFLQI KAWCHLLRNFELELVSPFPE INWNALWGAKGIWMVRYKRRPFS * 

SEQ ID 14 = Arabldopsis obtusifoliol C14a- 
15 demethylase (clone ID: ATA101105) nucleotide sequence 

GACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGAATTCG 
GCTCGAGCTTGTTCACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTGCTCATTT 
TTTCAAAGCTTCTGAATCTGATCTTAGTCAGGAGGAAGTGTATCAGTTCAATGTCCCTAC 

20 TTTTGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTTCGTCAGGAGCAGTTCGGTT 
CTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGGTTACTGA 
AGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATATTAAGGTTGAGCT 
AGAGAGGCTCATCATCTTGACTGCAAGTGATGTTTACTGGGTCGAGAAGTTCGTGATCAG 
CTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTGACAATGGAATGCTTCCCATCAGT 

25 GCTTCCCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCACCG 

SEQ ID 15 = Arahidopsis obtusifoliol C14a- 
demethylase (clone ID: ATA101105) amino acid translation 

30 HYRRAMTSHARVRKLGIRLELVHKKITFLIGPEVSAHFFKASESDLSQQEVYQFN 
VPTFGPGWFDVDYSVRQEQFGSSLRHLELTS 

SEQ ID 16 = Arstbldopsis obtusifoliol C14a- 
demethylase (clone ID: ATA202967) nucleotide sequence 

35 

TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGCTTCGA 
AAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCCGTTTTCTCTT 
CTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTATCAATCAAATTGCTTC 
CTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAACAAATTGTTGAAGACGGGTT 

40 TGGTTATAGTGGCGACACTTGTTATAGCCAAACTCATCTTCTCTTTCTTCACTTCTGATT 
CTAAGAAGAAGCGTCTTCCTCCTACTCTTAAAGCTTGGCCTCCATTGGTTGGAAGTCTTA 
TCAAATTCTTGAAAGGACCTATTATTATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTG 
TGTTTACTGTTAATCTTGTTCACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTG 
CTCATTTTTTCAAAGCTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATG 

45 TCCCTACTTTTGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGT 
TTCGGTTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 
TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATATTAAGG 
TTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGTCGAGAAGTTC 
GTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTGACAATGGAATGCTTC 

50 CCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCACCGCCGTCGTGACCGTGCCC 
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GAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATTGGGTCGAGAAAACGCTCTGGAAAAA 
CAGAGAACGACATGCTGCAGTGTTTCATCGAATCAAAGTACAAAGATGGTAGACAGACAA 
CCGAATCTGAAGTCACTGGTTTGCTCATTGCTGCTCTGTtTGCAGGACAACACACGAGCT 
CTATCACTTCCACCTGGACCGGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTG 
5 CTCTTGATGAGCAGAAGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCT 
TATCCGAGATGGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCAC 
TCATCATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGO^CCTCCCCTGCATTTGCCAACCGCTTAC 
CGCACATCTTCAAAGACCCCGACACCTACGACCCAGAAAGATTCTCCCCTGGAAGAGAAG 

10 AGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGGGAAGGCACGGGTGCC 
TTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATGGAGTCATTTGTTGAGGAACT 
TCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATTGACTGGAACGGTATGGTGGTTGGAG 
TTAAAGGCAATGTGATGGTGCGTTACAAGAGGCgcCAGCTTTCTTAAAGACAAGTTTAAG 
GTTATTGCAGCTTTGGATTTTTCTCTCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTT 

15 TTAGTTTTGTTGTTGAATAATTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGAC 
ATTTATTTTTAAGCTTCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 17 = Arahidopsis obtusifoliol C14a- 
demethylase (clone ID: ATA202967) amino acid translation 

20 

MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIKFLKGP 
IIMLREEYPKLGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLiSQQEVYQFNVPTFGPG 
WFDVDYSVRQEQFRFFTEALRWKLKGYVDlVnWTEAEDyFSKWGESGEVDIK^ 
ILTASRCLLGREVRDQIjFDDVSAIjFHDLDNGMLPISVLFPYLPIPAHRRRDRAREKLSEI 
25 FAKI I GSRKRS GKTENDMLQCF I E S KY KDGRQTTE S E VTGLL I AALFAGQHTS SITS TWT 
GAYLMRYKEYFSAALDEQKNLIAKHGDKIDHDILSEMDVLYRCIKEALRLHPPLIMLIV^ 
SHSDFSVTARDGKTYDIPKGHIVATSPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGA 
FSYIAFGGGRHGCLGEPFAYLQIKAIWSHLLRNFELELVSPFPEID1)^A1WVGVKGNV^^ 
RYKRRQLS* 

30 

SEQ ID 18 = Arabidopsis obtusifoliol C14a- 
demethylase (clone ID: ATA403931) nucleotide sequence 

TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGCTTCGA 

35 AAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCCGTTTTCTCTT 
CTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTATCAATCAAATTGCTTC 
CTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAACAAATTGTTGAAGACGGGTT 
TGGTTATAGTGGCGACACTTGTTATAGCCAAACTCATCTTCTCTTTCTTCACTTCTGATT 
CTAAGAAGAAGCGTCTTCCTCCTACTCTTAAAGCTTGGCCTCCATTGGTTGGAAGTCTTA 

40 TCAAATTCTTGAAAGGACCTATTATTATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTG 
TGTTTACTGTTAATCTTGTTCACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTG 
CTCATTTTTTCAAAGCTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATG 
TCCCTACTTTTGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGT 
TTCGGTTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 

45 TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATATTAAGG 
TTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGTCGAGAAGTTC 
GTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTGACAATGGAATGCTTC 
CCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCACCGCCGTCGTGACCGTGCCC 
GAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATTGGGTCGAGAAAACGCTCTGGAAAAA 

50 CAGAGAACGACATGCTGCAGTGTTTCATCGAATCAAAGTACAAAGATGGTAGACAGACAA 
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CCGAATCTGAAGTCACTGGTTTGCTCATTGCTGCTCTGTtTGCAGGACAACACACGAGCT 
CTATCACTTCCACCTGGACCGGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTG 
CTCTTGATGAGCAGAAGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCT 
TATCCGAGATGGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCAC 
TCATCATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGCAACCTCCCCTGCATTTGCCAACCGCTTAC 
CGCACATCTTCAAAGACCCCGACACCTACGACCCAGATy^GATTCTCCCCTGGAAGAGAAG 
AGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGGGAAGGCACGGGTGCC 
TTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATGGAGTCATTTGTTGAGGAACT 
TCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATTGACTGGAACGCTATGGTGGTTGGAG 
TTAAAGGCAATGTGATGGTGCGTTACAAGAGGCgcCAGCTTTCTTAAAGACAAGTTTAAG 
GTTATTGCAGCTTTGGATTTTTCTCTCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTT 
TTAGTTTTGTTGTTGAATAATTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGAC 
ATTTATTTTTAAGCTTCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 19 = Arabidopsis obtusifoliol C14a- 
demethylase (clone ID: ATA403931) amino acid translation 

MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSIilKFLKGP 
IIMLREEYPKIiGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLSQQEVYQFNVPTFGPG 
VVPD VDY S WQEQFRFFTEALRVNKLKG Y VDMMVTEAEDY F S KWGE SGE VD I KVELERL I 
I LTASRCLLGREVRDQLFDDVSALFHDLDNGMLP I S VLFP YLP I PAHRRRDRAREKLSE I 
FAKI I GSRKRSGKTENDMLQCF I E S KYKDGRQTTESEVTGLL I AALFAGQHTS S I TSTWT 
GAYLMRYKEYFSAALDEQKNIilAKHGDKIDHDILSEI^VIiYRCIKEALRLHPPLIM 
SHSDFSVTARDGKTYDIPKGHIVATSPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGA 
FSYIAFGGGRHGCLGEPFAYLiQIKAIWSHIiLRNFELELVSPFPEIDWNAMWGVKGNVMV 
RYKRRQLS* 

SEQ ID 20 = Arabidopsis sterol methyl transferase I 
protein sequence (Accession NO: U71400) See Appendix D 

SEQ ID 21 = Tobacco sterol methyl transferase I protein 
sequence (from Prof. Pierre Benveniste Accession NO: U81312) 
See Appendix E 

SEQ ID 22 = Arabidopsis sterol methyl transferase II 
protein sequence (Accession NO: X89867) (from Prof- Pierre 
Benveniste) See Appendix F 

SEQ ID 23 = Arabidopsis sterol C5-desaturase protein 
sequence (Accession NO: X90454) See Appendix G 

SEQ ID 24 = Rubber truncated HMGRlml (S566 to A) 
nucleotide sequence 

ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACCTCATC 
GATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAAGACTACCATT 
ATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCACCCTTAGTCTCGGAG 
GAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGATACCCTCCTATTCTCTGGAG 
TCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGATTCGACGCGAGGCTTTGCAGAGGATG 
ACAAGGAGGTCGCTGGAAGGCTTGCCAGTAGAAGGGTTCGATTACGAGTCGATTTTAGGA 
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CAATGCTGTGAAATGCCAGTGGGATACGTGCAGATTCCGGTGGGGATTGCGGGGCCGTTG 
TTGCTGAACGGGCGGGAGTACTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCG 
AGCACTAATAGAGGGTGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTG 
AAGGATGGCATGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAG 
5 TTGAAGTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 
TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATCTTTAT 
ATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTCTAAAGGGGTT 
CAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGATGTCATTGGAATCTCA 
GGAAATTTTTGTTCGGATAAGAAGCCTGCTGCTGTAAATTGGAT.TGAAGGACGTGGCAAA 

10 TCAGTTGTTTGTGAGGCAATTATCAAGGAAGAGGTGGTGAAGAAGGTGTTGAAAACCAAT 
GTGGCCTCCCTAGTGGAGCTTAACATGCTCAAGAATCTTGCTGGTTCTGCTGTTGCTGGT 
GCTTTGGGTGGATTTAATGCCCATGCAGGCAACATCGTATCTGCAATCTTTATTGCCACT 
GGCCAGGATCCAGCACAGAATGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTC 
AATGATGGAAAGGATCTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTC 

15 GGAGGTGGAACTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGT 
GCAAACAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCAAGAGT 
CACATGAAGTACAACAGAGCCAGCAAAGATATGTCTAAAGCTGCATCTTAG 

20 SEQ ID 25 = Rxibber truncated HMGRlml (S566 to A) amino 

acid translation 

MARASHDVWDLEDTDPNYLIDEDHRLVTCPPANI STKTTI I AAPTKLPTSEPLIAPLVSE 
EDEMIWSVVDGKIPSYSLESKLGDCKRAAAIRREALQRMTRRSLEGLPVEGFDYESILG 
QCCEMPVGYVQIPVGIAGPLLljNGREYSVPmTTEGCLVASTNRGCKAIYIiSGGATSVIjL 
25 lODGMTRAPVWFASATRAAELKFFLEDPDNFDTIiAVVFNKSSRFARLQGIKCSIAGKNLY 
IRFSCSTGDAMGlVttWV'SKGVQNVLEFLQSDFSD^ 

S WCEAI I KEEWKKVLKTNVASLVELNMLKNLAGS AVAGALGGFNAHAGNI VSAI FI AT 

GQDPAQlSrVESSHCITMMEAWDGKDLHISVTMPSIEVGWGGGTQLASQSACIiNLL^ 

ANKESPGSNSRLLAAIVAGSVLAGELSIJyiSAIAAGQLVKSHMKYlS^ 

30 



SEQ ID. 26 = Rubber truncated HMGRlTn2 (S567 to A) 
nucleotide sequence 

35 ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACCTCATC 
GATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAAGACTACCATT 
ATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCACCCTTAGTCTCGGAG 
GAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGATACCCTCCTATTCTCTGGAG 
TCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGATTCGACGCGAGGCTTTGCAGAGGATG 

40 ACAAGGAGGTCGCTGGAAGGCTTGCCAGTAGAAGGGTTCGATTACGAGTCGATTTTAGGA 
CAATGCTGTGAAATGCCAGTGGGATACGTGCAGATTCCGGTGGGGATTGCGGGGCCGTTG 
TTGCTGAACGGGCGGGAGTACTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCG 
AGCACTAATAGAGGGTGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTG 
AAGGATGGCATGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAG 

45 TTGAAGTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 
TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATCTTTAT 
ATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTCTAAAGGGGTT 
CAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGATGTCATTGGAATCTCA 
GGAAATTTTTGTTCGGATAAGAAGCCTGCTGCTGTAAATTGGATTGAAGGACGTGGCAAA 

50 TCAGTTGTTTGTGAGGCAATTATCAAGGT^GAGGTGGTGAAGAAGGTGTTGAAAACCAAT 
GTGGCCTCCCTAGTGGAGCTTAACATGCTCAAGAATCTTGCTGGTTCTGCTGTTGCTGGT 
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GCTTTGGGTGGATTTAATGCCCATGCAGGCAACATCGTATCTGCAATCTTTATTGCCACT 
GGCCAGGATCCAGCACAGAATGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTC 
AATGATGGAAAGGATCTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTC 
GGAGGTGGAACTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGT 
5 GCAAACAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCAAGAGT 
CA.CATGAAGTACAACAGATCCGCCAAAGATATGTCTAAAGCTGCATCTTAG 



10 SEQ ID 27 = Rubber truncated HMGRlm2 (S567 to A) amino 

acid translation 

MAI^SHDWDLEDTDPNYLIDEDHRLVTCPPANI STKTTI I AAPTKLPTS 

EDEMIWSVVDGKIPSYSLESKLGDCKRAAAIRREAIiQRMTRRSLEGLPVEGFDYESIL^ 

QCCEMPVGYVQIPVGIAGPLLLNGREYSVPMATTEGCLVASTNRGCKAIYLSGGATSVIjL 

15 kdgmtrapwrfasatraaelkffledpdnfdtlawfnks srf;^u^qgi KCS I AGKNLY 
irfscstgdamgmnt^skgvqiwleflqsdfsdmdvigisgnfcsdkkpaav^ 
s wceai i keewkkvlktwaslvelnmlknia f i at 

gqdpaqnvesshcitmmeavitogkdlhiswmpsievgtvgggtqiasqsacl^ 
ankespgsnsrliiaaivagsvlagelslmsaiaagqlvkshmkynrsa^ 

20 

Example 6. Arahidopsis obtusifoliol C14a-demethYlase 
constructs 

The ArcJ^idopsis obtusifoliol C14a-demethylase gene was 
amplified from two separate Arahidopsis mRNA samples (SIN 2 
25 and Keto-10) through use of primers BXK33 and BXK34, as 
described below - 



BXK33 (SEQ ID 28): 5 ' -GAGATCTGAACCCTAACGAGAG-3 ' 
30 BXK34 (SEQ ID 29) : 5 ' -GGAGCTCTTAAGAAAAGGGACGACGC-3 ' 

The primer BXK33 has a Bgl II cleavage site shown in 
bold. The primer BXK34 has a Sac I cleavage site shown in 
bold. The actual size of the structural gene is 1.445 Kb. 

35 The Arahidopsis mRNA was amplified using a Perkin Elmer 

GeneAmp RT-PCR kit. The reverse transcription reaction used 
25 mM MgCls (4 ^il ; 5 mM final), lOX PGR buffer {2 \xl) , di 
DEPC water (1 j-il) , 2 |al each of 1 mM solution of each of 
four dNTPs (dGTP, dATP, dUTP, dCTP) , RNase inhibitor (1 jal 

40 of 10 units per |il stock) , MMLV reverse transcriptase (1 fil 
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of a 2.5 \J/\xl stock), Oligo d(T)16 Primer ( 1 |al of a 2 . 5 
stock) , and 2 \il of an Arahidopsis polyA RNA sample. The 
reaction mix was incubated at room temperature (about 20°C) 
for 10 minutes, then in a PCR machine for one cycle (15 min. 
5 at 42°C, 5 min. at SS^'C and 5 min. at 4''C) . 

Separate primer-mediated amplification reactions were 
carried out using Taq DNA polymerase and Vent DNA polymerase 
to obtain Arahidopsis obtusifoliol C14a-demethylase cDNA 
from the amplified mRNA sample. 

10 



Taq PCR Reaction 


Vent PCR Reaction 


4 jil 25 mM MgCla 


4 ^il 25 mM MgClj 


8 jxl lOX PCR buffer II 


8 |al lOX Vent PCR buffer 


65.5 fxl di DEPC water 


65.5 iLll di DEPC water 


0.5 \xl AmpliTaq polymerase 


0.5 ixl Vent polymerase 



After 1 minute and 35 seconds at 95''C, 1 ^tl each of 15 
|j.M stocks of the upstream and downstream primers (BXK33 and 
BXK34) were added to the PCR reaction (100 )il total PCR 

15 reaction volume) and the PCR reaction solutions were 

subjected to 35 cycles OS^'C for 15 seconds, then 60^C for 30 
secqnds) . The amplified PCR reaction was then maintained at 
72°C for 7 minutes and then stored at 4°C. An amplification 
positive control reaction was carried out under the same 

20 conditions with DM151 and DM152 primers. 

DM151 

(SEQ ID 3 0): 5 ' -GTCTCTGAATCAGAAATCCTTCTATC-3 ' 
DM152 (SEQ ID 31): 5 ' -CATGTCAAATTTCACTGCTTCATCC-3 ' 

25 

Electrophoresis of the nucleic acid solutions after PCR 
amplification displayed an amplification product 
corresponding approximately to the size of the desired 1.445 
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Kb Structural gene. The fragment was cloned into an M13 
vector. A representative sequencing reaction consisted of: 
10 |Lil of plasmid DNA (200-500 ng) , 2 |al of M13 Forward or 
Reverse primer (15 picomoles) and 8 jil of Big Dye Terminator 
5 Reaction Mix (PE Applied Biosci.)- The clone copy of 

ATA101105 was called CPR17398. The sequence of the selected 
clone {Arahidopsis obtusifoliol C14a-demethylase) is 
identified as SEQ ID NO: 9. 

The predicted polypeptide secjuence for the cloned 

10 Arabidopsis obtusifoliol C14a"-demethylase sequence was 

subjected to a BLAST search in the public database and found 
to align with the sorghum obtusifoliol 14 -alpha demethylase 
polypeptide (ATA101105/U74319/gl658192 ; and gl216657/U74319) 
exhibiting 75-78% sequence identity and 87-90% sequence 

15 homology. The cloned nucleic acid encoding Arabidopsis 

obtusifoliol C14a-demethylase (SEQ ID No: 9) is missing the 
5' end. 

The 5 ' terminal portion of the structural gene was 
obtained by the RACE (Rapid Amplification of cDNA Ends) PGR 
20 using primers BXK3 9 and BXK4 0 per manufacturer's 
instructions (Clontech) . 

BXK3 9 ( SEQ ID 3 2 ) : 5 ' - GAGATCTCCACAGATTTAAAGAACCCTAACG- 3 ' 
25 BXK4 0 (SEQ ID 33): 5'- GGAGCTCGGTTTTTAAGAAAAGGGACGACGC - 3 ' 

The cloned nucleic acid encoding full length 
Arabidopsis obtusifoliol C14a-demethylase is identified as 
SEQ ID No:8. The amplified Arabidopsis obtusifoliol C14a- 
30 demethylase structural gene is useful for making constructs 
that express Arabidopsis obtusifoliol C14a-demethylase in 
transgenic plants . 
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In light of the detailed description of the invention 
and the examples presented above, it can be appreciated that 
the several aspects of the invention are achieved. 

It is to be understood that the present invention has 
5 been described in detail by way of illustration and example 
in order to acquaint others skilled in the art with the 
invention, its principles, and its practical application. 
Particular formulations and processes of the present 
invention are not limited to the descriptions of the 

10 specific embodiments presented, but rather the descriptions 
and examples should be viewed in terms of the claims that 
follow and their equivalents. While some of the examples 
and descriptions above include some conclusions about the 
way the invention may function, the inventor does not intend 

15 to be bound by those conclusions and functions, but puts 
them forth only as possible explanations. 

It is to be further understood that the specific 
embodiments of the present invention as set forth are not 
intended as being exhaustive or limiting of the invention, 

20 and that many alternatives, modifications, and variations 
will be apparent to those of ordinary skill in the art in 
light of the foregoing examples and detailed description. 
Accordingly, this invention is intended to embrace all such 
alternatives, modifications, and variations that fall within 

25 the spirit and scope of the following claims . 
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WHAT IS CLAIMED IS : 

1. A recombinant construct comprising: 

(a) a DNA sequence encoding a polypeptide having 

3 -hydroxy-3-methylglutaryl -Coenzyme A reductase enzyme 

activity, and 
5 (b) a DNA sequence encoding at least one 

polypeptide having steroid pathway enzyme activity 

selected from the group consisting of: 

squalene epoxidase enzyme activity, 
sterol methyl transferase I enzyme activity, 
10 sterol C4-demethylase enzyme activity, 

obtusifoliol C14a-demethylase enzyme activity, 
sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity - 

2. The recombinant construct of claim 1, further 
comprising at least one promoter operably linked to 
said coding regions . 

3. The recombinant construct of claim 1, further 
comprising a first promoter operably linked to said DNA 
sequence encoding a polypeptide having 3-hydroxy-3- 
methylglutaryl- Coenzyme A reductase enzyme activity and 

5 a second promoter operably linked to said DNA sequence 
encoding at least one polypeptide having steroid 
pathway enzyme activity, wherein said first and second 
promoters may or may not be the same. 

4 . The recombinant construct of claim 2 or 3 further 
comprising an operably linked transcription termination 
sequence located 3' to , each coding region. 
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5. A recombinant construct according to claim 3 
wherein the promoters are selected from the group 
consisting of seed-specific promoters, organ specific 
promoters and constitutive promoters. 

6. A recombinant vector comprising operably linked in 
the 5 ' to 3 ' direction, 

a promoter, a DNA sequence encoding a polypeptide 
having 3 -hydroxy- 3 -methylglutaryl -Coenzyme A reductase 
5 enzyme activity, and a transcription termination signal 
sequence ; 

a promoter, a DNA sequence encoding at least one 
polypeptide having steroid pathway enzyme activity 
selected from the group consisting of 
10 squalene epoxidase enzyme acitivity, 

sterol methyl transferase I enzyme activity, 
sterol C4-demethylase einzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, 
sterol C5-desaturase enzyme activity, and 
15 sterol methyl transferase II enzyme activity, 

and a transcription termination signal sequence. 

7 . The recombinant vector of claim 6 wherein said 
vector is a plant expression vector. 

8 . A transformed host cell comprising a recombinant 
construct of claim 1 . 

9. The transformed host cell of claim 8 wherein said 
cell is a plant cell. 
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10- A transformed host cell comprising a recombinant 
vector of claim 6. 

11. The transformed host cell according to claim 10 
wherein said host cell is a plant cell . 

12 . A transformed host cell comprising a plant 
expression vector comprising: 

(a) as operably linked components in the 5' to 3 ' 
direction, a promoter, a DNA sequence encoding a 

5 polypeptide having 3 -hydroxy- 3- me thylglutaryl- Coenzyme 
A reductase enzyme activity, and a transcription 
termination signal sequence; and 

(b) as operably linked components in the 5* to 3 ' 
direction, a promoter, a DNA sequence encoding at least 

10 one polypeptide having steroid pathway enzyme activity 
selected from the. group consisting of 

squalene epoxidase enzyme activity, 
sterol methyl transferase 1 enzyme activity, 
sterol C4-demethylase enzyme activity, 
15 obtusifoliol C14a-demethylase enzyme activity, 

sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity, and 
a transcription teirmination signal sequence. 

13 . The transformed host cell according to claim 12 
wherein said host cell is a plant cell. 

14 , A cell culture comprising transformed host cells 
according to any of claims 8-13. 
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15- A plant comprising at least one transformed host 
cell of any of claims 8-13. 

16. A plant according to claim 15 wherein said 
transformed host cell comprises a plant cell. 

17. A storage organ, comprising at least one 
transformed host cell according to any one of claims 8- 
13 . 

18. A storage organ including at least one transformed 
host cell containing a recombinant vector comprising: 

(a) as operably linked components in the 5* to 3 ' 
direction, a promoter, a DNA sequence encoding at least 

5 one polypeptide having 3-hydroxy-3-methylglutaryl- 
Coenzyme A reductase enzyme activity, and a 
transcription termination signal sequence; and 

(b) as operably linked components in the 5' to 3 ' 
direction, a promoter, a DNA sequence encoding a 

10 polypeptide having steroid pathway enzyme activity 
selected from the group consisting of 

squalene epoxidase enzyme activity, 
sterol methyl transferase I enzyme activity, 
sterol C4-demethylase enzyme activity, 
15 obtusifoliol C14a-demethylase enzyme activity, 

sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity, and 
a transcription termination signal sequence . 

19. The storage organ according to claim 18 wherein 
said recombinant vector is a plant expression vector. 
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20- A process of increasing the formation of steroid 
pathway products in a transformed host cell as compared 
to an otherwise identical non- trans formed host cell 
comprising : 

5 (1) transforming a host cell with a recombinant 

vector comprising 

(a) as operably linked components in 
the 5 ' to 3 ' direction, a promoter, a DNA 
sequence encoding a first polypeptide having 

10 3 -hydroxy-3-methylglutaryl -Coenzyme A 

reductase enzyme activity, and a 
transcription termination signal sequence; 
and 

(b) as operably linked components in 
15 the 5' to 3 ' direction, a promoter, a DNA 

sequence encoding at least one polypeptide 
having steroid pathway enzyme activity 
selected from the group consisting of 

squalene epoxidase enzyme activity, 
20 sterol methyl transferase I enzyme 

activity, 

sterol C4-demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme 
activity, 

25 sterol C5-desaturase enzyme activity, 

and 

sterol methyl transferase II enzyme 
activity, 

and a transcription termination signal 
30 sequence, and 

(2) regenerating said transformed plant cell into 
said transgenic plant . 
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21. The process according to claim 20 wherein said 
first encoded polypeptide comprises the catalytic 
region and at least a portion of the linker region but 
is free from the membrane binding region of an 3- 

5 hydroxy- 3 -methylglutaryl -Coenzyme A reductase enzyme. 

22. The process according to claim 20 wherein said 
promoters are promoters whose regulatory function is 
substantially unaffected by levels of squalene in said 
transgenic plant . 

23. The process according to claim 20 wherein said 
plant cell is selected from the group consisting of 
canola, soybean, corn, tobacco, cotton, tomato, potato, 
saf flower, sunflower, peanut, rape, flax, oil palm, 

5 cuphea and alfalfa. 

24. A transgenic plant produced in accordance with the 
process of claim 20. 

25. A transgenic plant seed transformed with a vector 
comprising a DNA segment that encodes a polypeptide 
having 3 -hydroxy-3 -methylglutaryl -Coenzyme A reductase 
activity, and a DNA segment that encodes a polypeptide 

5 having a steroid pathway enzyme activity selected from 
the group consisting of 

squalene epoxidase enzyme activity, 

sterol methyl transferase I enzyme activity, 

sterol C4-demethylase enzyme activity, 

10 obtusifoliol C14a-demethylase enzyme activity, 

sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity. 
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and a promoter suitable for driving expression of said 
polypeptides in said plant cell,- wherein said 

15 transgenic plant seed is capable of germinating into a 
transgenic plant that over-accumulates steroid pathway 
products relative to a non- trans formed plant of the 
same strain; and mutants, recombinants, genetically 
engineered derivatives thereof and hybrids derived 

20 therefrom, wherein said mutants, recombinants, 

genetically engineer derivatives thereof and hybrids 
derived therefrom maintain the ability to 
overac cumulate steroid pathway products. 



26, A plant, the genome of which includes introduced 
DNA comprising: 

DNA encoding a polypeptide having 3-hydroxy-3- 
methyulglutaryl- Coenzyme A reductase enzyme activity, 
5 wherein said plant contains an elevated level of total 
accumulated sterol, compared to an otherwise identical 
plant, the genome of which does not comprise said 
introduced DNA encoding a polypeptide encoding 3- 
hydroxy~3 -methyulglutaryl-Coenzyme A reductase enzyme 
10 activity; and 

further comprising introduced DNA encoding at least one 
polypeptide having steroid pathway enzyme activity 
selected from the group consisting of 

sc[ualene epoxidase enzyme activity, 
15 sterol methyl transferase I enzyme activity, 

sterol C4-demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, 
sterol C5-desaturase enzyme activity, and 
sterol methyl transferase II enzyme activity, 
20 wherein said introduced DNAs are operatively linked to 
regulatory signals that cause seed- specif ic expression 



wo 02/061072 



PCT/LIS02/00255 



149 

of said introduced DNAs, and wherein seeds of said 
plant contain a reduced level of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
25 stigmasta-7-enol, or campesterol compared to the seeds 
of an otherwise identical plant whose genome does not 
contain introduced DNA encoding said at least one 
polypeptide having steroid pathway enzyme activity. 

27, A plant, the genome of which includes an 
introduced DNA sequence encoding a polypeptide having 
3 -hydroxy- 3 -methyulglutaryl- Coenzyme A reductase enzyme 
activity and an introduced DNA sequence encoding at 
least one polypeptide having squalene epoxidase enzyme 
activity, sterol methyl transferase I enzyme activity, 
sterol C4-demethylase enzyme activity, obtusif oliol 
C14a-demethylase enzyme activity, sterol C5-desaturase 
enzyme activity, or sterol methyl transferase II enzyme 
activity, wherein said introduced DNA is operably 
linked to regulatory signals that cause seed- specific 
expression of said introduced DNA, and wherein said 
plant produces seed having an elevated level of a 
steroid pathway product, compared to a corresponding 
transgenic or non-transgenic plant that does not 
contain said introduced DNA. 

28. A plant comprising introduced DNA encoding 

(i) a polypeptide having 3 -hydroxy- 3 -methyulglutaryl- 
Coenzyme A reductase enzyme activity and 

(ii) at least one additional polypeptide having 

5 squalene epoxidase enzyme activity, sterol methyl 

transferase I enzyme activity, sterol C4-demethylase 
enzyme activity, obtusifoliol C14a-demethylase enzyme 
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activity, sterol C5-desaturase enzyme activity, or 
sterol methyl transferase II enzyme activity, 
10 wherein said plant that produces a storage organ having 
an elevated level of a sterol pathway product compared 
to a corresponding transgenic or non- transgenic plant 
that does not contain said introduced DNA. 

29. The plant of claim 28, wherein said storage organ 
contains a reduced level of squalene, cycloartenol , 24- 
methylene cycloartenol , obtusif oliol , stigmasta- 7 -end , 
campesterol, or mixtures thereof, compared to a 

5 corresponding transgenic plant that comprises 

introduced DNA encoding a polypeptide having 3-hydroxy- 
3 -methyulglutaryl -Coenzyme A reductase enzyme activity 
but that does not contain introduced DNA encoding at 
least one polypeptide having squalene epoxidase enzyme 
10 activity, sterol methyl transferase I enzyme activity, 
sterol C4-demethylase enzyme activity, obtusif oliol 
C14a-demethylase enzyme activity, sterol C5-desaturase 
enzyme activity, sterol methyl transferase II enzyme 
activity. 

30. The plant of any one of claims 24 to 29, wherein 
said regulatory signals cause seed-specific expression 
of said introduced DNAs . 

31. A seed of a plant according to any one of claims 
24 to 30. 

32 . Progeny of a plant according to any one of claims 
24 to 30. 
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33. A plant cell or transformed plant cell of a plant 
according to any one of claims 24 to 30. 

34. A cell culture, comprising cells according to 
claim 33, 

35. A plant produced from a seed according to claim 
31. 

36. A method of producing a plant comprising sexually 
crossing a plant according to any one of claims 15, 16 
or 24-3 0 with a non- transformed plant. 

37. A plant produced by the method of claim 36. 

38. A plant according to any one of claims 24 to 30, 
wherein said plant is an apomicitic plant. 

39. A seed resulting from a cross of the plant of 
claim 3 8 with a nurse cultivar. 

40 . A uniform population of plants according to any 
one of claims 15, 16, 24-30, 37 or 38. 

41. Seed of a plant produced by the method of claim 
36. 

42. A part, other than a seed, of a plant according to 
any one of claims 15, 16, 24-30, 37 or 38. 

43. Oil containing at least one sterol pathway 
product, extracted from seed of a plant according to 
any one of claims 15, 16, 24-30, 37 or 38. 
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44. Oil according to claim 43, wherein sitosterol, at 
least one sitosterol ester, or mixtures thereof, 
comprise at least about 57% by weight of the total 
sterol compounds of said oil . 

45. Oil according to claim 43, wherein sitosterol, at 
least one sitosterol ester, or mixtures thereof, 
comprise at least about 0.08% of the dry weight of said 
seed. 

46. Oil according to claim 43, having a reduced amount 
of squalene, cycloartenol , 24-rmethylene cycloartenol , 
obtusifoliol, stigmasta-7-enol, campesterol, or 
mixtures thereof, compared to oil obtained from a 

5 corresponding plant that does not contain introduced 
DNA encoding at least one polypeptide having squalene 
epoxidase enzyme activity, sterol methyl transferase I 
enzyme activity, a sterol C4 -demethylase enzyme 
activity, obtusifoliol C14a-demethylase enzyme 
10 activity, sterol C5-desaturase enzyme activity, or 

sterol methyl transferase II enzyme activity; wherein 
the reduction is in the range of from about 10% to 
about 100% . 

47- A sitosterol ester composition extracted from seed 
of a plant according to any one of claims 15, 16, 24- 
30, 37 or 38- 

48. The sitosterol ester composition of claim 47, 
wherein an esterifying fatty acid has 2 to 22 carbon 
atoms in the main chain. 
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49. A cholesterol -lowering composition, comprising 
said oil of any one of claims 43 to 46. 

50. A food/ food ingredient, or food composition, 
comprising said oil of any one of claims 43 to 46, 

51. A pharmaceutical composition, comprising a 
cholesterol -lowering effective amount of said oil of 
any one of claims 4 3 to 46, and a pharmaceutical ly 
acceptable carrier, excipient, or diluent. 

52 . A method of lowering the plasma concentration of 
low density lipoprotein cholesterol, comprising orally 
administering to a human or animal subject an effective 
amount of a composition according to any one of claims 

5 49-51. 

53 . A method of treating or preventing an elevated 
plasma concentration of low density lipoprotein 
cholesterol, comprising orally administering to a human 
or animal subject an effective amount of a composition 

5 according to any one of claims 49-51. 

54. A method of making a food additive composition, 
comprising : 

obtaining oil containing a sterol pathway product 
compound from seed of a transgenic plant according to 
5 any one of claims 15, 16, 24-30, 37 or 38; and 

mixing said oil with an edible solubilizing agent, a 
dispersant, and optionally, an antioxidant. 

55. An isolated DNA molecule, having a nucleotide 
sequence selected from the group consisting of: 
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(a) disclosure squalene epoxidase SEQ ID NO:4, 6, 8, 
10, or the complement thereof; 
5 (b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency- 
equivalent to 0.5X SSC to 2X SSC, 0.1% SDS, at 55-65°C, 
and which encodes a polypeptide having squalene 
epoxidase enzymatic activity substantially similar to 
10 that of the disclosed squalene epoxidase; 

(c) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (a) , but 
which is degenerate in accordance with the degeneracy 
of the genetic code; and 
15 (d) a nucleotide seqiience encoding the same genetic 
information as said nucleotide sequence of (b) , but 
which is degenerate in accordance with the degeneracy 
of the genetic code. 

56- An isolated DNA molecule, having a nucleotide 
sequence selected from the group consisting of: 

(a) disclosure obtusifoliol C14a-demethylase SEQ ID 
NO: 12, 14, or 17, or the complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency 
equivalent to 0 . 5X SSC to 2X SSC, 0,1% SDS, at 55-65*^C, 
and which encodes a polypeptide having obtusifoliol 
C14a-demethylase enzymatic activity substantially 
similar to that of the disclosed obtusifoliol C14a- 
demethylase ; 

(c) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (a) , but 
which is degenerate in accordance with the degeneracy 
of the genetic code; and 
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(d) a nucleotide sequence encoding the same genetic 
information as said nucleotide sequence of (b) , but 
which is degenerate in accordance with the degeneracy 
20 of the genetic code. 

57. A recombinant construct, comprising as operably 
linked components in the 5' to 3 ' direction, a seed- 
specific promoter, an isolated DNA molecule of claim 55 
or 56, and a transcription termination signal sequence. 

58. A recombinant vector comprising said recombinant 
construct of claim 57. 

59. The recombinant vector of claim 58, which is a 
plant expression vector. 

60. A transformed host cell, comprising said 
recombinant construct of claim 57. 

61. The transformed host cell of claim 60, wherein 
said host cell is a plant cell . 

62 . A method of producing squalene epoxidase or 
obtusifoliol C14a-demethylase, comprising culturing a 
transformed host cell of claim 60 for a time and under 
conditions conductive to the production of said 
5 squalene epoxidase or obtusifoliol C14a-demethylase 
enzyme, and recovering said squalene epoxidase or 
obtusifoliol C14a-demethylase produced thereby. 
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63. The transformed host cell of claim 8, further 
comprising a recombinant construct encoding a 
tocopherol synthesis pathway enzyme- 

64. The transformed host cell of claim 10, . further 
comprising a recombinant vector encoding a tocopherol 
synthesis pathway enzyme. 

65. The transformed host cell of claim 64, wherein 
said tocopherol synthesis pathway enzyme is S- 
adenosylme t hi onine - dependent y- tocopherol 
methyltransf erase . 

66. The transformed host cell of claim 64, wherein 
said vectors are plant expression vectors . 

67. The storage organ of claim 18, wherein said at 
least one transformed host cell further contains a 
recombinant vector comprising as operably linked 
components, a promoter, a DNA sequence enooding a 

5 tocopherol synthesis pathway enzyme and a transcription 
termination sequence. 

68. The storage organ of claim 18, wherein said 
tocopherol synthesis enzyme is S-adenosylmethionine- 
dependent y- tocopherol methyltransf erase . 

69. A process of increasing the formation of steroid 
pathway products and tocopherols in a transformed host 
cell as compared to an otherwise identical non- 
transformed host cell comprising, 
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(A) transforming a host cell with: 

(1) a recombinant vector comprising 

(a) as operably linked components in the 5' to 3' 
direction, a promoter, a DNA sequence encoding a first 
polypeptide having 3 -hydroxy- 3 -methylglutaryl -Coenzyme 
A reductase enzyme activity, and a transcription 
termination signal sequence; and 

(b) as operably linked components in the 5' to 3 ' 
direction, a promoter, a DNA sequence encoding at least 
one polypeptide having steroid pathway enzyme activity 

15 selected from the group consisting of 
squalene epoxidase enzyme activity, 
sterol methyl transferase I enzyme activity, 
sterol C4-demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, 

20 sterol C5-desaturase enzyme activity, and 

sterol methyl transferase II enzyme activity, 

and a transcription termination signal sequence; 

(2) a recombinant vector comprising as operably linked 

components, a promoter, a DNA sequence encoding at 

25 least one tocopherol synthesis pathway enzyme selected 
from the group consisting of 

3- deoxy-D-arabino-heptulosonate-7-P synthase (DAHP 
synthase) , 

shikimate kinase, 
30 a prephenate dehydrogenase, 

1-deoxy-d-xylulose 5-phosphate synthetase (DXS) , 
1-deoxy-d-xylulose 5-phosphate reductoisomerase (DXR) , 

4- diphosphocytidyl-2C-methyl-d-erythritol synthase 

(YgbP) , 

35 4 -diphosphocytidyl -2C-methyl -d-erythritol kinase 
(YchB) , 
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2C-methyl-d-erythritol 2 , 4-cyclodiphosphate synthase 
(YgbB) , 

the gene product of GcpE, LytB, 
40 geranylgeranylpyrophosphate synthase (GGPPS) , 

geranylgeranylpyrophosphate hydrogenase (GGH) , 

phytyl /prenyl transferase (PPT) , 

4 -hydroxy -phenyl pyruvate dioxygenase (HPPD) , 

2 -methyl - 6-phytylplastooquinol , 
45 tocopherol methyltransf erase I (MTI) , 

tocopherol cyclase and 

y-tocopherol methyltransf erase (GMT) ; 

4-amino-4-dexoyprephenate dehydrogenase (TryA) ; 

sir 1736; 
50 sir 1737 

ATPT2 ; 

AANTl ; and 

and a transcription termination sequence; and 
55 (B) regenerating said transformed plant cell into said 
transgenic plant. 

70. The process of claim 69, wherein said tocopherol 
synthesis enzyme is S-adenosylmethionine- dependent y- 
tocopherol methyltransf erase . 

71. A plant comprising at least one host cell of any 
of claims 63-66. 

72. A plant produced by the process of claim 69 or 70. 

73. An oil from the plant of claim 71 or 72. 

74. A seed from the plant of claim 71 or 72. 
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75. An oil from the seed of claim 74. 

76. A plant produced by crossing the plant of . any of 
claims 26-30, 35, 37, or 38 with a plant comprising a 
recombinant expression vector containing as operably 
linked components a promoter, a DNA sequence encoding 

5 tocopherol synthesis pathway enzyme, and a 
transcription termination sequence. 

77. The plant of claim 77, wherein said tocopherol 
synthesis enzyme is S-adenosylmethionine- dependent - 
tocopherol methyltransf erase . 
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Figure 6: Construct pMON438 IS 
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Figure 8: Construct pMON5 1850 
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Figure 1 1 : Sterol composition of Rl transgenic soybean seeds when Arabidopsis truncated HMCR 

^"L^oi^J^n --'^ "^^'^^v. lio*"''^ overexpressed using seed-specific 7S promoter ( daca from 
pMON-l30d7: p7s::W/ HiVlGR truncated). -it i rum 
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Figure 12: Sterol composition of Rl transgenic soybean seeds when ArabiJopsis truncated HMGR 
(catalytic domain without linker) and Aradidopsis SMTII were overexpressed (data from 
pMON43058; p7S::At HiMCR truncated & p7S::^/ SMTII). The expression of the genes is 
controlled by the seed-specific 7S promoter. 
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Figure 20: Construct pMON53740 
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VRENIARGMA 

HMGRclustalW{ rat} FLDKELTGLN EALPFFUliIil DLSRASALAK FALSSNSQDE 

VRSNIARGMA 

HMGRclustalW{ rabbit} FIiDKSLTGIiN EALPFFLIiLI DLSRASAIxAK FALSSNSQDE 

VRSNIARGMA 

KMGRclustalW{ human} FLDKELTGLN EALPFFLXjIiI DLSRASTLAK FALSSNSCD^* 

VRENIARGMA 

HMGRclustalw{ mouse} 



HMGRclustalW{ xsnopus } FLDKELTGI*W EALPFFLLLI DLSKASAIiAK FALSSNSQDE 
VRDNIARGMA 

HMGRclustalW{sea urchin} LFGLELTGLN EALPFFLLLI DLTKASALTK FALSSTTQNE 
WDNIARGMA 

HMGRclustalW{ cockroach} FLGSDVSDLK DALFFFLLLI DLSKATVLAQ FALSSRSQDS 
VXHNIARGXA 

HMGRclustalW{drosophila} FLGSDISSLK DALFFLLLVI DLSNSGRLRS GAMGSN.QAE 
VTQNIARGLE 

HMGRclustalW{dictyostel} 



HMGRclustalW{schistosom} MLRNKRQLNT LFYTLILFTF ALCSLSSVLF V^YTS-AX-L 
LSTSVFLLFS 

HMGRclus talW { archaeoglo } 



HMGRclus t al W {p s eudomonas } 

Consensus FLDKELTGLN EALPFFLLL- DL-RASALAK FALSSNSQDE 

VRENXARGMA 
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350 

HMGRc lus t a 1 W { me t hanobac } 



HMGRclus calW { me chanococ } 
HMGRclus tal W { halobacter } 
HMGRclustalW{ sulf olobus } 



HMGRclus talW{ yeasC2} QEGAYLIRDY LFYISSFIGC AIYARHLPGL VNFCXIiSTFM 

LVFDLLiLSAT 

HMGRclus talW{ yeastl} EEGGRLIQDH LLCIFAFIGC SMYAHQLKTIi TNFCXIiSAFZ 

LIFHLILTPT 

HMGRclustalw{ph.ycomyces} , 



HMGRclus calW{ fusarium} VSS^/MSICVW PLRMMA5RRV A£>r/AKGDCE LNRVHGDAPli 
FGRXSSSIPK 

HMGRclust:alW{ Candida} EQGPLLLRDE LFMITAFLGC SFYASYLDGL KNFCIU\AI,I 
lAFDIiiTTST 

HMGRclustalw{dict:ycst£2 } 



HMGRclusCalW{wheat:l} 



HMGRclus talW{ rice} MRIT. 

. . . NGLAM^/S 

HMGRclus t:alW{ com} PE.... PSRA AA RVQAGDALPL PIRHT. 

. - .NLIFSAIi 

HMGRclus talw{wheat3} 



HMGRclust:alw{wheat:2} 
HMGRclus talw{ soybean} 



KMGRclust:alW{rubbertre3} HK !• PP LXPSDYSLPI. SLYLA. . 

NALVFSL 

HMGRclust:alW{rosyperiwi} NQ Q PS Z'9SiSSiyyn,^h PLYLA. . 

. . ■ NGVFFTL 

HMGRclustalW{ tomato} QQ. . . .QEDK N TL LIDASDALPIi PIiYIiTT. 

2TGLFFTM 

HMGRclustalW{woodtofaacc} QQ. QEQD N SL LI.ASDALPIi PLYLT . . 

. . .NGLFFTM 

HMGRclus talW{ potato} VS SPKASDALPL- PLYLT. . 

. - .NGLFFTM 

HMGRclus talW{ radish} DD....DDRR K TLTS PPKASDALPL PLYLT.. 

. . .NA'/FFTL 

HMGRclustalW{arabadopsisl} SD....DDHR RR. .ATTIAP PPKASDALPL PLYLT.. 
. . .NAVFFTL 

HMGRclustalW{cucumismel} SA. . . .ADHL KR A SPKASDALPL PLYLT. . 

. • .NTIFFTL 

HMGRclustalW{rubbertrs2} 



HMGRclus taiw{rubbertrel} PV EDRS P T TPKASDALPL PLYLT 

. . .NA^/FFTL 

HMGRclustalW{camptothec} KV DAVD L PD SPKASDALPL PLYIT 

. . .NGVFFTL 

HMGRclus talW{arabadops2} VAVD PP LRXASDALPL PLYLT 

. . .NTFFLSL 

HMGRclus talW{chiiieseham} ILGPTFTLDA LV..ECLVIG VGTMSGVRQL EIMCCFGCMS 
VLANYFVFMT 
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HMGRclusCalW{chaneseha2) ILGPTFTIiDA LV. .ECLVTG VGTMSGVHQL EIMCCFGCMS 
VLANYFVFMT 

HMGRclustalW{syrianhanist} ILGPTFTTJDA LV. .ECLVXG VGTMSGVRQL EIMCC^CMS 
VU^NYFVFMT 

HMGRclusCalW{ rat} ILGPTFTIiDA LV. .ECLVIG VGTMSGVRQL EIMCCFGCMS 

VLANYFVFMT 

EMGRclust:alW{ rabbit} ILGPTFTLDA LV. .ECLVIG VGTMSGVRQL EIMCCFGCMS 
VLANYFVFMT 

EMGRclustalW{ human} ILGPTFTLDA LV. .ECLVXG VGTMSGVRQL EIMCCFGCMS*. 

VLANYFVFMT 

HMGRclustalW{ mouse} 

HMGRclustalW{ xenopus} ILGPTFTLEA LV. .ECLVIG VGTMSGVRQL EIMCCFGCMS 
VLANYFAFMT 

HMGRclustalW{sea urchin} ILGPTITLDT W. . TTLVIS IGTMSSIRKT-! EVFCCFGtlS 
LIANYFVFMT 

HMGRclust:alW{ cockroach} MLGPTITLDT W. . ETLVIG VGMLSGVRRL EVLCCF^VCMS 
VIVNYWFMT 

HMGRclustalW{drosophila} LLGP.AISLDT IV. .WLLVG VGTLSGVQRL EVLCMFAVLS 
VLVNYWFMT 

HMGRclustalW{diccyostel} m LF.JVPPNLETK ELFWIIY.IL 

ILIPKVFAKV 

HMGRclustalw{schistosom} DLSVFFIVLE YYLLEISLVN YHEAXRHCLL SHLFSNQLFV 
DEMLGMFLKT 

HMGRclustalw{archaecslo} 

HMGRclustalW{pseudomonas } 

Consensus ILGPTFTLDA LV- -ECLVIG VGTASD-LPL -LYCTFGCMS 

VLANYFFFMT 
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400 



HMGRclust:alW{me&hanobac} 
HMGRc lus t al W { me t hanococ } 
HMGRclustalw{halobacter} 
fiNGRclustalw{sulfolobus} 



yeasC2} FYSAILSMKL EINIIHRSTV IRQTIi- -EED GWPTTADII 



HMGRclustalW{ 
YKDETASEPH 

HMGRclustalW{ yeastl} FYSAILALRL EMNVTHRSTI IKQTLi. ,EED GWPSTARII 

SKAEKKSVSS 

HMGRclustalW{phycoinyces} : 



HMGRclustalW{ fusarium} FSCVLMXIiGFI FVNIVNICSI PrRNP..SSM STIRTWASSL 
GGVIAPLSVD 

HMGRclustalW{ Candida} FLSAILSLKL EINQIHRSTL LREQL-.EDD GLTETTVDDV 
LKSNSLAGTK 

HMGRcluscalW{dict:yoste2 } 



HMGRclus talW { wheat 1 } 

rice} LVLSSCDLVR LCSDRER. . . PL . . - 



KMGRclustalW{ rice] LVLSSCDLVR LCSDRER. . . PL GGRErA 

TWCQLASW 

EMGRclustalW{ com} FAASLAYLMR RWREKIRSST PLHA VGLASiML 

AIFGLVASLI 

KMGRclustalW{wheat3} 



HMGRc lus talW { wheaC2 } 
EMGRclustalW{ soybean} 



.VTFPEIA 



HMGRclus talW{rubbertre3} FFSVAYFLLH RWRSKIRXST PLHI 

ALICLVASVI 

HMGRc lus CalW(rcsyperiwi} FFSVMVFLLT RWREKIRNAT PLHV VTLSELA 

AIiASLIASVI 

KMGRclusCalW{ tomato} FFSVMYFLLS RWRSKIRNST PLHV VTLSELG 

AXVSLIASVI 

HMGRc 1 US talW{ weed tobacc} FFSVMYYLLS RWREKIRNST PLHV VTFSELV 

AIASLIASVI 

HMGRclus talW{ potato} FFS^/M'i'FLLV RWREKIRNSI PLHV VTLSSLL 

AX1VSLIASVI 

HMGRclustalW{radish} FFSVAYYLLH RWRBKIRYNT PLKV VTVTELG 

AIVALIASFI 

HMGRclustalW{arabadopsisl} FFSVAYYLLH RWRDKIRYNT PLKV VTITELG 

AXIALIASFI 

HMGRclus talW{cucumisinel} FFSVAYYLLH RWRDKIRNST PLHV VTLSEIA 

AIVSLMASFI 

HMGRclus tal w { rubbertre2 } 



Hr^GRclust:alW{rubbertrel} FFSVAYYLLH RWRDKIRNST PLHI VTLSEIV 

AXVSLIASFI 

HMGRclustalW{ camp to thee} FFTWYYLLV RWREKIRNST PLHV VTLSEIA 

AXFTFVASFI 

HMGRclus tal W{arabadops2} FFATVYFLLS RWREKIRNST PLHV . . .VDLSHIC 

ALIGFVASFI 

HMGRclus talW{chineseham} FFPACVSLVL ELSRESRSGR PIWQ...LSE FAR^/LESHE. 
NKPNPVTQRV 
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nSS^SI^''"^'^"°^"™="J FFPACVSLVL EI.SRESREGR PIWQ. . .I.SH FAH^.1.EEEE. 

NKPN^Si"""'''^ "'^^ FFPACVSLVI. ELSRESREGR PI„q . . . r.sH FA^VX^EHEE . 

KK^S^Sf ''^ "^^'"^ FFPACVSI^VI. ELSRESSEGR PIWQ. . .X,SH F.^RVXEEEE . 

nSSSSSJ"''"^ """"^^ FFPACVSLVI. ELSRESREGR PIWQ. . .I.SK FARVLEEEE. 
HMGRclustalW{ mouse} 

HK^S^Rr"^"^ FFPACVSLVI. ELSRESREGR PIWQ...I,SQ F.:^V-.EEEED 

RkS;?w5rv'"'''^'"" ""^"^'"^ FFPACLSLVI, ELSNSNKyGR PVWH...r,GR FAHVI.EEEED 

QK^WQRv"'"^ cockroach} FVPACLSLIL . ELSRSGESGR PAWED.. KSL IIXALKEED. 

QK^JS^QRf '""^^""'^^^''"^ FYPACLSLIF DLSRSGVDMS WREKAKGSI. PLKSLTEEE. 

VT^SvqS"'"^^'"'"°''"'^ MSVREI.-prF KWGFIXIRRSN pr.VP xx^SN^H^ 

RI^;S^"*"^^''''"^°^°"'^ SLFSISTTSK YAVLESIFKC TLMEQIIYIM rVFVFLPSFM 
HMGRc lus t al W { ar chaeoglo } 

EMGRclustalW{pseudomonas} 

VASRI consensus FFSACYSLLI. -WRRKIRNST PLKV- --LSH FARVTLESZA AX?N- 
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HMGRclustalw{me thanobac} 



HMGRc 1 us t a 1 W { me t hanococ } 



HMGRclusCalW{haIobaccsr} 



\ HMGRclus talW { sul f olobus } 



HMGRclustalW{ yeast2 } 

NTVYFDSTIY 

KMGRclustalW{ yeast 1} 

NSLYFDKHRV 

HMGRclustalW{phycomyces} 



HMGRclus talW { f usar ium} 
PSIHYALGSA 

HMGRc lust:alw{ Candida} 
SAGNETNDTF 

HMGRclust:alW{dictyoste2 } 



HMGRclus talW { wheat: I } 



HMGRclus talW{ rice} 



HMGRclustalW{ corn} 



HMGRclustalw{wheat3 } 



HMGRclustalW{wheat:2} 



HMGRc lust alw{ soybean} 



HMGRclustalW{2rubbertre3 } YLLGFrGIGF VHSFS.RAST 



HMGRclustalW{rosyperiwi} VLVSFFGLDF VQSLIYKPNN 



HMGRclustalW{ tomato} YliLGFFGIGF VQTFVSRG2JN 



HMGRclustalW{woodtobacc} YLLGFFGIGF VQSFVSRDNN 



HMGRclus talW{ potato} YLLGFFGIGF VQSFVSRSNS 



HMGRclustalW{radish} YLLGFFGIDF VQSFISRP 



HMGRclus talW{arabadops is 1} YLLGFFGIDF VQSFISRASG 



HMGRclus talW{cucumismel} YLLGFFGIDF .VQSFIARSSP 



HMGRclustalW{rubbertre2 } 



HMGRclustalW{rubbertrel} YliLGFFGIDF VQSFIARASH 



HMGRclustalw{camptothec} YLLGFFGIGL VQPFTSRSSH 



HMGRclus talW{arabadops2} YLDGFCGIDL IFRSS..SD 



HMGRclustalW{chineseham} KMIMSLGLVX, YHAHSRWIAD PSPQNST. . • TE.HSKVSLG 
LDEDVSKRIE 



FIiRSNVAIIL GKASVIGLLL LINLYVF. . . . .TDKLNATIIi 
FLJTLS^/WII MKLSVXIiIiFV FINFYNF, - . GANWVN.DAF 



PFKVASNGLD AILPTAKSNN RPTLVTV. . . LTPIKYELEY 
TFTDAPSTLV WAKVAGVSV FFGLHFY. . . GFGSAWLSDii 



YLLSLFAHPD APATTTGDDD 
YLLSFFGXAF VQSIVSSGDD 
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EMGRclustalw{chiaeseha2 } 
LDEDVSKRIS 

HMGRclus t alW { syrianhams t } 
LDEDVSKRXE 

HMGRclus talW{ 
ZiAHOVSKRXE 

HMGRclus t:alW{ 
LDENVSKRIE 

HMGRclus t:alW{ 
IiDENVSKRIE 
HMGRclus talW{ 



rat} 
rabbit} 
human} 
mouse} 
xenopus} 



KMIMSLGLVL VHAHSRWIAD PSPQNST. . 

KMIMSLGLVL VKAESRWXAD PSPQNST. . 
KMIMSLGLVL VHAHSRWIAD PSPQNST . . 
KMIMSLGLVL VHAHSRWIAD PSPQNST. . 
KMIMSLGLVL VHAHSRWIAD PSPQNST. , 



- TE.HSKVSLG 
. TE.HSK^/SLG 
. AH.QSKVSLG 
. AD.NSKVSLG 
. AD.TSK^/SLG 



HMGRclustalW{ 
LDDMMPKRVS 

HMGRclustalw(sea urchin} 
GNLLTDKXID 

HMGRc lus talW { cockroach } 



EMGRclustalW(drosophila} 
LSLN^/SNNRT 

KMGRclustalW {dictyos tel } 



HMGRclustalW { schistosom} 
NLIGY^/VNPN 

HMGRclustalW { archaeoglo } 



KMIMSLGLVL VKAESRWISE PSSQNST. . . SISDHHVTTM 
KMIMRTGLVL VHAHSYWLAS ....NDT,.. ELMSHDMLYD 

KVIMSAGLML VHAH.RWVRC 

KLIMTTGLMA VHIYSRSVS? - . . .AAT. . . TMVDKTLTPT 

KPLPYIPQKN QQQQQKQQPS 

YGEQKKCLVS NKGVSSSTRK RRHSYSSGKS YVEYRRMSVH 



HMGRclustalW{pseudomonas } 



VSKRIE 



Consensus YLL-FFG-VL V-A-SR-ISD PSPQNST SKVSLG LDE- 
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HMGRclus talw{me thanobac } 
HMGRclus tal W { me thanococ } 
HMGRclus talW { halobactier } 
HMGRclus talW { sulf olobus } 

HMGRclus t:alw{ yeast2} 
VLIjI IDS VSN 

HMGRclustalW{ yeastl} 
VLLLLRNVSV 

HMGRclus calW{phycoinyces} 

HMGRclus talW{ fusarium) 
IVIALALSVA 

HMGRc luscalW{ c andi da } 
VLSFIGLIST 

HMGRclus tal W { diccyos t e2 } 

HMGRclus tal W { wheatl } 
HMGRclustalW{ rice) 
HMGRclus talw{ corn) 
HMGRclust:alW{ wheats } 
HMGRclustalW{wheat2} 
HMGRclus talW { soybean} 

HMGRcluscalw{rubbercrs3} 

R 

HMGRclus calW { rosyperiwi } 

RN 

HMGRclus tal W{ tomato} 

RC 

HMGRclus talw { woodtobacc } 

RR 

HMGRcluscalW{ potato} 

RR 

HMGRclus talW { radish} 

R 

HMGRclus tal W { arabadops i s 1 } 

R 

HMGRclustalW{ cucumismel } 

L 

HMGRclus talW{ rubbertre2 } 

HMGRclus talw{nibbertrel} 

R 

HMGRclustalW{camptothec} 

R 

HMGRclustalW{arabadops2} 

HMGRclus talW{chirxeseham} 
ET. .ESTLSL 



451 

SliPNrlNYXD IGNIiSNQVII SVIiPKQWTP LKXYKQISDS 
SLPDFITSNA SENFKSQAIV SVTPLLyYKP IKSYQRXHDM 

ASNPAVN.DA FHEHFQGYGV GGRMVGGILX SLEDPVI,SiC^ 
TLYDAVA.DQ IPIGSNGTLV TLFPTRFFLP EKLSTQIZAV 

. .D 

. .DEDFLVGS G 

-,D.SWDVEE y D DDNIIIK2DT 

-.E-GWEXEE EILMVEDS 

. .D.SWDE N DEEFLIiKEDS 

. .DECWDEED S N DEQFLLEEDS 

..D.SWDIED E N ASQLIIEEDS 

. .D.SGDSER DFDDH 

..D.A5TOLAD T ; I DDDDH 

. -D.AWDLED EIDRT 

-.D-VWDLED T D P.NYLIDEDH 

. .DDVWGVDD DE D VDEIVLKEDT 

. .DDVWVNDG 

PSVSLWQFYL SKMISMDIEQ WTIiSLAFLL AVKYIFFEQA 



nones ■*nrr'3 At % 
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HMGRclust:alW{chizieselia2 } 
ET. .ESTLSL 

HMGRclus t al W { syr iazihams t } 
ET. .ESTLSL 

KMGRclustaiW{ 
ET. .ESTLSL 

HMGRclus talW{ 
ET. .ESTLSL 

HMGRclustalWf 
ET- .ESTLSL 
EMGRclustalW{ 



rat} 
rabbit} 
human} 
mouse} 
xenopus } 



EMGRclustalW{ 
ET. .ESTFSM 

HMGRclustalW{sea urchin} 
ATYPEKRVSI 

HMGRclus talw{ cockroach} 
EE. .LPTVKL 

EMGRclus talw{ drosoDhi la } 
DP . . . LPDQL 

HMGRclus talw{ die tyost el} 
IN 

EMGRclus talW{schistoscm} 
VFPVLYHITA 

HMGRclus tal W{ archaeoglo } 



PSVSLWQFYL SKMISMDIEQ WTLSLAFLL AVKYIFFEQA 
PSVSLWQFYL SKMISMDISQ WTLSLAFLL AVKYIFFEQA ; 
PSVSLWQFYL SKMISMDIEQ VITLSLALLL AVKYIFFEQA 
PSVSLWQFYL SKMISMDISQ VITLSLALLL AVKYIFFEQA 
PSVSLWQFYL SKMISMDISQ VITLSLALLL AVKYIFFSQT 

PSMPLWQFYL SRMVTMDVSQ IITLGLALLL AVKYIFFSQT 
PTMPLWEFYA TRLWPPTLDY ILTAJLAT^/L ASKYIFFSDL 

.SIALWPDLT S LRY FCTHCDTGVS YSRWSFASEG 

ESGEIADIII KWLT.MSADH IVISIVLIAL WKFICFDNR 

. .QDYIQQPQ Djjjj 

CKYKCWSTTF VIFVSLIILK LNNRYSERIS SFKHNSSENE 



HMGRclus CalW{pseudomonas} 



ESTLSL 



Consensus PSDSLWDFY- SKMISMDIEQ WTLSLA-LL AVKYIFFED- RT- 
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HMGRclus tal W { me chanobac } 



501 



HMGRcluscalW {methanococ } 
HMGRclus talW{halobacter} 
HMGRclusCalW{ sulf Globus } 



HMGRclust:alW{ yeast:2} AIRDQFISKL LFFAFAYSIS INVYLLNAAX IHTGYMNFQ. 

. . PQSNKXDD 

HMGRclus t:alW{ yeastl} AIRDRFVSKL VLSALVCSAV INVYIiLNAAR IKTSYTADQL 

VKTEVTKKSF 

HMGRclus tal W{phycomyces } 



HMGRclustalW{ fusarium} LNGYLFNVAR WGIKDPNVPE KNIDRNELAR ARSFNDTGS . 
AT 

HMGRclus t:alW{ Candida} AARDKYISKF ILFAFAVSAS INVYLLNVAR lETTRXiSDA. 
IE 

HMGRclustalW{dict:yoste2} : 



HMGRclus talW { wfceatl } 

HMGRclustalw{ rice} ...... 

HMGRclustalW{ corn} : 

HMGRclustalW{wheat3} 

HMGRclustalW{wheat:2} 

HMGRclustalW{ soybean} 

HMGRclus taiw { rubbert:re3 } 

EMGRcl us tal W { rosyperi wi } G . 

HMGRclus taiw{ tomato} G. 

HMGRclus talw{woodtobacc} G. 

HMGRclus taiw{ potato} G. 

HMGRclus talw{ radish} 

HMGRclustalW{arabadopsisl} 

HMGRclus talW{cucumismel} 

HMGRclus talW { rubber tre2 } 

HMGRclus tal W { zrubbertrel } 

HMGRclus talW { camp to thee } 

HMGRclus talW{arabadops2 } 



HMGRclus talW{chineseham} KN..PITSPV VTPKKAPDNC CRREPLLVRR SEKLSSVEEE 
PGVSQDRKVE 
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EMGRC lus t al W { chine s eha2 
PGVSQDRXVE 
HMGRclustalw{syrianhamst 

PGVNQDRKVE 

HMGRclust:alw{ rat 
PGVNQDRKVE 

HMGRc 1 US c al W { r ahfa i C 
AGMSQDRKVE 

EMGRcluscalw{ human 
TGINRERKVE 
HMGRclustalW{ mcuse 



xencous 



HMGRcluscalW{ 
S . . SKSETEA 

HMGRclustalW{sea urchin 
TSRTQACQTD 

HMGRC lus t al W { coc3cr oach 
LALAVKFVFF 

HMGRclustalW{drosophila 

TENSAAVRTIi 

HMGRclustalW{dict;yostel 



HMGRclus t al W { s chi s tosom 
INSQIiRNLNI 

HMGRc lus cal W { archaecglo 



HMGRclustalW{pseudcmcnas 



SQDRKVE 



Consensus 



KN-.PITSPV VTPKKAPDNC CRREPLLVRR SEKLSSVEEE 
KN..PITSPV ATPKKAPDNC CRREPVLSRR KSKLSSVEEH 
KN..PITSPV VTPKKAQDNC CRREPLLVRR NQKLSSVEED 
iQI. .PITSPV VTQICCVPDSC CRRSPWVRN NQKrCSVEEE 
KN.,PITSPV VTQKXVPDNC CRREPMLVMI NQKCDSVEES 

KN..PIISPV AVQKKQI3SC CRRSPSQ.EK TVE^/STTESA 

MEGEEWNPG SDHSDASSVS TIGTLSSSPS TSDVRVIESM 

VTGDSWNSN STDDAQLKYY IMRVTLTV.-S ADHIVTLIIiX* 

RQ SGPV AIEA2G^SQTT PIDESKVE QEXD 

YEVTSIFHFI YNIFIT/INAN liWTLFLGLF LFKRIRIiNKP 
KN--PITSPV VT-KKAPDNC CRRSPLiiVRR --K-SSVEEE. -G 
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HMGRclu5CalW{methanobac} 



551 



EMGRclus talW{mechar.ococ } 
EMGRc lus t alW { halobac t er } 
KMGRclustalW{sulf Globus) 



HMGRclustalW{ yeast:2} liWQQKSATI EFSHT RSMPA SSGLST^V^:^ 

KDIIISSEIQ " 

EMGRclustalW{ yeast 1} TAPVQKAST? VLTN KT/IS GSKV7CSL«SA 

QSSSSGPSSS 

EMGRclustalW{phycomyces} 



EMGRclust:alW{ fusarium} LPIiGHYVPPT PMRTQ ^STPA ITDDEASG 

. .LHMTKARP 

HMGRclustalW{ Candida} LKKPKKXASX TAVSV PXAW V<DS^T'^'<SS 

EILHSSSESE * * 

HMGRclusCalW{dictyosts2} . . KGKSVNVS DLXDQ EIIAi VDKGSXQ- 

. - .IDILETRI* 



HMGRclustalW{wheatl} 



KTIGRclustalWf rice} . . . ^ . . .GCG GSR Ra a 

PPEPAPMHGH 

KMGRclustalW{ corn} SSGS AAA PSROFACA 

PAPCSLLGSP ' ' . ^ ^ 

HMGRclustalw{wheat3 } 



KMGRclustalW{ wheat2 } 
KMGRclustalw{ soybean} 



HMGRclustalW{rubbertre3 } PTG AC AAPSLDCS 

LSLPTXIHAP 

HMGRclust:alW{rosyperiwi} ..T..NCTTL GO AVPopS'^/x^ 

KIAPWPQQP ^ 

HMGRclustalW{ tomato} ..P...ATTLGC AVPAPP^U^ 

QIAPMAPPQP 

KMGRclustalW{woodtobacc} . . P. . .ATTL OCT . AVPPPPAL 

QIVPMVPPQP 

HMGRclust:alW{ potato} . . PCAAATTL GC vVP^ddt/r 

KIAPNT/PQQP 

EMGRcluscalwfradish} LVTC PF? poo s 

QIVAAKIiPN? 

HMGRclustalw{arabadcpsisl} LVTC SF? . ^rP 

IVSVAKLPNP 

EMGRclustalW{cucumismel} ZiXDN NRV AAPRSASA 

VALPSiO/^/DA 

EMGRclustalW{rubbertre2} 



HMGRclustalW{nibbertrel} LVTC PPA NISTKTTI 

lAAPTKLPTS 

HMGRclustalW{cantptothec} TVP CAA APVDCPLP 

PIKPXV^/DPV 

HMGRclustalW{arabadops2 } MIPC NQ SLDCREVL 

PIKPNSVDPP 

HMGRclustalWfchineseham} VIKPLWETE SAS RATFVLG.A .SGTSPPVAA 

RTQELEIEL? 
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FIG. 5ZX 



VIKPLWETE SAS J^ATFVLG.A .SGTSPPVT^ 

VXXPLVASTS STS RATrVLG.A .SGGCSPVAL 

VIKPLVAEAE TSG RATFVLG.A .SAASPPUU. 

VIKPLVAETD SPH RAATWGGS . SFPDTSLVL 

VIKPLVAETD TPN RAT^/VGNS . SLLDTSSVL 



HMGRcl us t al W { xencpus } 
EDraiTMFDIiP 

HMGRclustalW{sea urchin} 
VIK2EEVEWV 

HMGRclus t al W { cockroach } 
APSQPIiEEPP 

HMGRclus talW{drosophi.la} 
PLRHRL VGP I 

HMGRclustalW{dictyostel} 
QQTPDITNQP 

EMGRcluswalW{schistosom} 
KDDDYIDRND 

HMGRclustalw{archaeoglo} 



HMGRclus calw {pseudotnonas } 



Consensus VIXPLVASTE --S -RATFV-G-A -SA-P^»poA- - 

PPEIELP 



02061072A2 I > 



HMGRcl us t a 1 W { chine s eha2 } 
RTQEtiEIELP 

HMGRclus CalW { syrianhamst } 
GTQEPEISIiP 

HMGRclustalw( rat} 
GAQEPGISI.P 

HMGRclus talw{ rabbit} 
ETXSPEIELP 

EMGRclustalW{ human} 
VTQEPEIELP 
HMGRclus talw{ mouse} 



VIKPLPIiETS P KAKFIVG. . -DSSPl^EIiS? 

PVTASPRNSR SSSI^VSSKSM KPAR7TIGSS GSGSEDESEE 

ETRDEIiTTTR GMDG W VEVSSPVSHK YVQTEQPSCS 

LFTIEDQSSA N ASTQTDLI. 

SGK2Q EQ QQQQQQQQ 

PKIKSTLISD QVKQSPVLPK FSKKLMDIPL QSRKRiyCI*H 
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650 

HMGRc lus t: a 1 W { me ehanobac } 
. • . IMDDLME 

HMGRclus talw {methaiiococ } 
YNDILEKMLN 

KMGRclustalw{halobaccer} 
AASLADR^/RE 

HMGRclus talW{sulfolobus} »^ 



.MS. 
.MHN 
.MTD 



HMGRc lus talW{ 
NTEVSNIiWN 

HMGRclustalW{ 
NKEVAAIiVIH 

HMGRcl us t a 1 W { p hycomy c e s } 



yeas t2 } NNE . CVYALS SQDEPIRP . L SNLVELME . . . . KZQLXNMN 
yeastl} SEEDDSRDIS SLDKKIRP.L EELEALIiS. . . . SGNTKQLK 



.... PNRS .N EELEKIiLS . 



. ENALREMT 



HMGRclust:alW{ fusarium} ANL 

DEEVISLSMR 

HMGRclusCalW{ Candida} SEQ SSRP.L EQVIELYK. . . DGIC/iCT'LV 

DDEWSIiVTA 

HMGRclus talW{dictyoste2} PNN F QRAVKIRR. . . . ICLLARDLQ 

KEHQRAUiAQ 

KMGRclust:alW{wheat:l} 



rice} 



KMGRclustalW{ 
DEEIVAAVAS 

HMGRclust:alw{ corn} AA A 

DEEIVASWA 

HMGRclustalw{wheat3 } 



. GGMMEGD 
, PSKMPED 



HMGRclust:alw{wkeat2 } 
HMGRclus talw{ soybean) 



HMGRclustalw{rubbercre3 } I VSTTT. 

DEQIIKSWS 

HMGRclustalW{rosyperiwi} SX MV.I lEKPAPLI 

DEDIIKAWA 

HMGRclus talW{ tomato} S MS.M VEKPAPLI. 

DEEIIKSWQ 

HMGRclus talW{woodtobacc} S2CV AA.M SEKPAPLV 

DEEIIKSWQ 

HMGRclustalW{ potato} AXV ALS-Q TSKPSPII 

DEE HQS WQ 

HMGRclustalW{radish} E Qo^LPKS 

DEEIVKSVLD 

HMGRclus talw { arabadops is 1 } EP IV T^SLP^S 

DEErVKSVID 

HMGRc lus t al W { cucumi srae 1} EA LN TI ^LPE^ 

DEEWKMWD 

HMGRclustalw{irubbercre2 } 



-TSTIiSDO 
. TPQNSES 
. TSASSGE 
. TPAASEE 
- MPALSED 



HMGRclustalW{rubbertrel} EP . ..... .LI APLVSEE 

DEMIVNSWD 

HMGRclus talW{ camp Cothec} P I SPPSSEE 

DEEIIKSWE 

HMGRclus talw { arabadops 2 } RE SELDSVE 

DEEIVKLVID 

HMGRclustalw{chineseham} SE PRP.N EECLQILE. . SAEKGAKFLS 

DAEIIQLVNA 
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FIG. 521 



HMGRclusCalW{chineseha2} SE pro.n eeclqtle. SAEKGAXFts 

DAEIIQLVNA ^-^w-A^r. aAc.tt,A-t:7Iio 

EMGRclustalWfsyrianhamst} SE PRP.N EECLQILE. SAEKGAfC?T s ' 

DAElSSS^i"^"^""^ """^ ESCLQILE.. SASXGAiG-LS 

DiJ^SLiiJr''^"^ ''^''"^^ PR-°-N EECLQXLG.. KAEXGAKFI.S 

EMGRclustalW{ human} RE PIIP • H EECLQILG . . NASKGAXPLS 

DAEIIQLVNA w**^. . «a-,iv.«.a.^^ 

HMGRclustalW{ mouse} 

DA^^SL^^r^^""^ xenopus} EE prp.l DECVRILK. , NPDKGAQYLT 

DD^SLicr^^^^^^^ urchin} LET EI-KAPRP.M PELLEIL. . . NVGKGPNALT 

DC^^L^r^^"^ cockroach} AS NRS.I DECLSVC. . . KSDVGAQALS 

DBEIVSrVKA*'^^"^'^''°^°^^'^^^^ ^ ^^'"^ QECLDILNST EEGSGPAALS 

EMGRclustalw{dict:yostel} TKIN •p.-T-oTxr-r c 

NEEILIKLEK KXIPIKE^S 

DT^LQLLSH^^'^"^^*^^^^^^^^'"^ SSSVSTFSNT CKNSNERPSN VLDLDMLXEK IKQGLGHSLS 

HMGRclus talw { ar chaeoglo } 

• MQVLRLDRR 

HMGRclus calW{pseudoinonas} 

Consensus SE PRP-N EECLQII. -AEKGAKSLS 

DEEIIKLWA -"v^w A-j5.tjAi^ijS 
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651 

. .RTCGVK 
. .KrCVGIS 
. . SQSGAS 
• . KI VGVG 

yeast2} G. .KLPLYSL EXXLEDTTRA VLVRRKALST laAESPILVS. 



HMGRclustalW{meChanobac} GR,.IKLYEI E.RHVPVDEA VRIRREFIE. 
. . LEHVSNYS 

HMGRclustalW{meChanococ} GE.-IKPYQL D.KMFGSXIA TEIRHKFIE. 
. . FKHICNYS 

HMGRclustalW{halobacter} GD..r*RLHEL E.AHADADTA AEARRLLVE. 
..LDAVGNYG 

HMGRclust:alW{sulfolcbus} GE..ISrHSV D.NIiLEANAA MVARRLAIiE. 
. .LPSI6STV 
HMGRclustalW{ 
. . . EKLPFRN 

HMGRclust=alW{ yeastl} G. .KLPLYAL EXXLGDTTRA VAVRRKAI»SI LASAPVUVS 

. . . DRLPYKN 

HMGRcluscalW{phycomyces} 



HMGRclustalW{ fusarium} G.-KIPGYAL EKTLGDFTRA VKIRRSIIAR MKAAADITHS 
LDRSKLPYHN 

HMGRclustalW{ Candida} G. .KLPLYAL EKQLGDNIiRA VAIRRKAISD liADAPVIiRS . 
. . .NKLPYLH 

HMGRclustalW{dictiyost:e2} A. . WAAAEX AATSGEDPSS IQPWPPTSN LDFEGSLTN. 



. LPVDK 

HMGRclustalW{wheatl} 



rice} G..ALPSKRL ESRLGDCRPJV ARLPJIEALR R^/TGRG 



EMGRclusCalW{ 
-VEGLPFDG 

HMGRclustalW{ com) G. .KVPSYAL EARLGDCRRA AGIRRSALR RITGRD 

. lEGLPIiDG 

KMGRclustalW {wheats } 



HMGRclust:alW{ wheat2 } 
EMGRclustalw{ soybean} 



HMGRclust:alW{rubbert:re3} G. .SIPSYSL ESKLGNCXRA AIiIRRETLQ RMSGRS 

. .LEGLPLDG 

HMGRclustalW{rosyperiwi} G. .KIPSYSL ESKLGDCKRA AGIRREALQ RITGKS 

..LEGLPLEG 

HMGRclust:alW{ . tomato} G. .KIPSYSL ESKLGDCKRA ASXRKSVMQ RITGKS 

. .LEGLPLEG 

HMGRclustalW{woodtobacc} G. .KMPSYSL ESKLGDCKRA ASIRXSALQ RITGKS 

. .LEGLPLEG 

HMGRclustalW{ potato} G.-KTPSYSL ES KLGDCMRA AS IRKEALQ RITGKS 

. .LEGLPLEG 

HMGRclustalW{radish} G. .WPSYSL ESRLGDCKRA ASIRREALQ RLTGRS 

. . lEGLPLDG 

HMGRclustalW{arabadopsisl} G..VXPSYSL ESRLGDCKRA ASIRREALQ RVTGRS 

. . lEGLPLDG 

HMGRclustal'^{cucurnismel} G. .SVPSYSL ESKLGDPKRA ASIRREALQ RTTGRS 

. . IHGLPFEG 

HMGRclus t alW { rubbertre2 } 



. RMTRRS 



HMGRclus talW{rubbertrel} G. .KIPSYSL ESKLGDCKRA AAIRREALQ. , 
. .LEGLPVEG 

HMGRclus talW{ camp tothec} G. -TTPSYAL ESKLGDSHRA AAIRREALQ RMTKKS 

. .LAGLPLDG 

HMGRclus talW{arabadops2} G. .TIPSYSL ETKLGDCKRA AAIRREAVQ RITGKS 

. . LTGLPLEG 

HMGRclus talW{chineseham} K. .HIPAYKL ETLMETKERG VSIRRQLLST K-.LPEPSS. 
. .LQYLPYRD 
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FIG. 3ZBB 



^^|J^talW{chineseha2) K. .EIPAyKL ETLMSTHSRG VSIHRQLLST K. .LPEPSS. 
^J^J^^lW{syrianliamst} K. .HIPAYKL ETLMETKERG VSIRHQLLST K. .LPEPSS. 

LoSIpYRd"^*"^^'"^ K-.HIPAYKL ETLMETHERG VSIRRQIiSA K..LAEPSS. 

EMGRclustalW{ rabbit} K. .HIPAYXL ETLMETKSRG VSIRRQLLSK K. .LPEPSS. 

^QYLPYRD^^''^ ^uman} K, .HIPAYKL ETLMETKERG VSIK.RQt,LSK K..LSZPSS, 

EMGRclustalW{ mouse} 

HMGRclusCalWf xenopus} K. .HIPAZXL ETMM2SPRHG VAXRRQMLSD K L^OPca. 

HMGRclusCalW{sea urchin} K. .EIPAYKL ENILDNPERG VAVP^OQIISK L Lnr-mi 

EMGRclustalW{ cockroach} G. .HIAGYQL EKWRNPERG VGIRRQXLTK T Anr.'-rr,a 

HMGRclust:alW{drosophila} GGTHCPLHKI ESVUDDPERG VRIRRQUGS R. .AXMPVGR 

HMGRclustalW{dictyostel) G..EVLAYHL ENELGDCSRA VEIRRxNILLEX OLSXX 

HMGRclusCalWfschistosom} G..RLKTRSL ESWRNPFRA VELRRLDLS TFI^lNt^ 

HI lERXPYKD ^-wiji-a Tf-J^JIMP 

HMGRclustalW{archaeoglo} EYKSGKIRPj;. MSSRIPGFYK LSVEERIiXKV AS^AGLST^r-r- 
^SLnANAG^^"^^^^'^'^°"''^''"^ LDSRLPAFRN LSPAARLDHI GQLLGLSHDD 

Consensus G—-IPSYSL ESKLGDCXRA VSIRREALSK K--LRITGSS 
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701 

750 

vpStsotal^^^''^^"'^^^^''''^^''^ idmerasrrn ienpigwqx plgvagplrv rgzhadgeyy 

HMGRclusCalW{methanococ} IDEEMAMKKN ISNMIGAIQI PLGFAGPLKI NGEYAKGE^v 
IPLATT£GAL " 

HMGRcluscalW{halobacter} FPAEAAES.A ISJJMVGSIQV PMGVAGPVSV DGGSVAGEKY 

HMGRclustalW{sulfolobus} IDYSEIKNKN ASNVIGAIQI PLGIVGPtrv NGDYAKGDFv 
VPMATT£GAL 

HMGRclustalW{ yeast2} YDYDRVFGAC CENVIGYMPI PVGVIGPIiIX DGT SY^' 

I?MATT£GCL .... 

HiMGRclustalW( yeastl} YDYDRVFGAC CENVIGYMPL PVGVIGPLVI DGT SYH 
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FIG. 3ZDD 



IPMATT5GCL 

HMGaclusCalw{phycomyces } 

. PMATTEGCL 

EMGRclustalW{ fusarium} YNWERFFGAC CSNVIGYMPL PVGVAGPLVI DGQ....SYr 
IPMATT5GVI, 

HMGRclustalW{ Candida} YBYDPJ/FGAC CEJT/IGYMPIa PVGVAGPLII DGK....PYH 
IPMATT£GCL 

HMGRclustalW{dictyoste2} FDYTKVLGAC CENVIGYIPI PVGVAGPILL DGK. .,.LVS 
IPMATT£X3CL 

KMGRclus talW { wheat 1 } 



KMGaclustalW( rice} MDYQAILGQC CEiMPVGr«/QL PVGVAGPLLL DGR....EYH 

VPMATT£GCL 

EMGRclustalW{ com} FDYASILGQC CSLPVGYVQL PVGVAGPLIiL DGR . . . . RFY 
LPMATT£GCI- 

EMGRclus talW {wheat 3 } 



HMGRclustalw{wheat2 } 
HMGRclustalW{ soybean} 



KMGRclus talW{ruhbertre3} FDVHSILGQC CSMAIGYVQI PVGIAGPLLL DGX....EYT 
VPMATT=X2CIj 

EMGRclustalw{rosyperiwi} FDYASILGQC CHMPVGY^/QL PVGIAGPLLL DGR. . . .EYM 
LPMATT£GCr. 

KMGRclustalW{ tomato} F5TYSSXLGQC CEMPIGYVQI PVGIAGPLLL NGK EFS 

VPMATTEGCL 

HMGRclustalW{woodtobacc} FDY5SILGQC CEMPIGYVQI PVGIAGPLLL DGR....EYS 
VPMATT£GCL 

HMGRclustalW{ potato} FDYSSILGQC CEMPVGYVQI PVGIAGPLLL DGR....EYS 

VPMATT£GCL 

EMGRclus talW{ radish} FDYDSILGQC CEMPVGYIQI PVGIAGPLLL DGY....EYS 
VPMATTEGCL 

KMGRclustalW{arabadcpsisl} FDYE3ILGQC CEMPVGYIQI PVGIAGPLLL DGY....EYS 
VPMATTEGCL 

HMGRclustalW{cucumismel} FDYESILGQC CEMPVGYVQI PVGIAGPLLL DGF EYT 

VPMATTEGCL 

HMGRclus talW { rubbertre2 } 



KMGRclus talw{rubbertrel} FDYESII.GQC CEMPVGYVQI PVGIAGPLLL NGR....EYS 
VPMATTEGCL 

HMGRclus talW{camptothec} FDYDSILGQC CEMPVGYVQI PVGIAGPLLL DGR..-,EYS 
VPMATTEGCL 

HMGRclus talW{arabadops2} FDYNSILGQC CEMPVGYVQI PVGIAGPLLL DGV....EYS 
VPMATTEGCL 

KMGRclus talW{chineseham} YNYSLVMGAC CEN^/IGYMPI PVGVAGPLCL DGK EYQ 

VPMATTEGCL 

HMGRclustalW{chineseha2} YNYSLVMGAC CENVIGYMPI PVGVAGPLCL DGK EYQ 

VPMATTEGCL 

HMGRclus tal5^{syrianhamst} YNYSLVMGAC CEJT/IGYMPI PVGVAGPLCL DGK EYQ 

VPMATTEGCL 

HMGRclus talW{ rat} YNYSLVMGAC CENVIGYMPI PVGVAGPLCL DGK EYQ 

VPMATTEGCL 

HMGRclustalW{ rabbit} YNYSLVLGAC CENVIGYMPI PVGWGPLCL DGK EFQ 

VPMATT£GCL 

KMGRclus talW{ human} YNYSLVMGAC CENVIGYMPI PVGVAGPLCL DEK EFQ 

VPMATT2GCL 

HMGRclus talW{ mouse} 



EMGRclus talW{ xsnopus} YNYSLVMGAC CENVIGYMPI PVGVAGPLLL NNK EYQ 
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FIG. 52 EE 



VPMATTSSCI* 












HMGRclustalW{sea urchin} 


YDYSFVSGAC 


CENVIGYMPV 


PVGVAGPMiL 


DGQ. 


. . .EFQ 


VPMATTBSCLi 








HMGRclust:alW{ cockroach} 


YDYIiKVMGAC 


CENVIGYMPV 


PVGVAGPLm. 


DGR. . 


. . .LVH 


VPLATT£GCIj 










HMGRclustalW{drosophila} 


FDYRKVLNAC 


CSSTVLGYVPI 


PVGYAGPIiLL 


DGE. . 


. . .TYY 


VPMATTH3AL 








HMGRclus talW { dictyos tel } 


FDFAKVQGQC 


CEJJVIGYVPI 


PVGTAGPIQL 


NGQ. . 


. .LVT 


IPMATT£GCI* 






HMGRclustalW{schlsCosoin} 


YDYRLVYGQC 


CEEVrGWIPI 


PVGKIGPLLL 


DGR. . 


. . SHY 


IPLATTEGCL 










HMGRclus talW { archaeoglo } 


. LPIoDVADRM 


lEJIVIGTFEL 


PLGIATMFLI 


DGK. . 


. .DYL 


IPMAIBCTSV 












HMGRclus talW{pseudomonas } 


ALPMDIANGM 


lENTVIGTFEL 


PYAVASJTFQI 


NGR. . 


. . DVI, \ 


VPLWZ£PSI 









Consensus FDY-SVLG-C CENVIGY--I PVGVAGPLLL DGK SYS 

VPMATTTOCL. 



HMGCoA binding 

E 



I 
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HMGRclustalW{methanobac} VASVNRGCSV ITHAGGATVR VTGDSMT.RA PVIRtoswh 
AliQLREWXYE 

HMGRcluscalW{methanococ} vaSVNRGCSI ITKCGGATVR VIDDKMT.RA PCLKTKSVs/D 
AXKVRDWIRS 

HMGRclustalW{halobacCer} LASVNRGCSV INSAGGATAR VLKSGMT-RA P^/FRVADVA2 
ASALVSWTRD 

KMGRclustalWfsulf Globus} lASVNRGIKA VTLSGGVRAK VLKDEMT.RA PVFKFDSXEQ 
IPNFLKFXHE 

HMGRcluscalW{ yeast2} VASAMRGCXA INAGGGATTV LTKDGMT.RG P'^/VRFPTLIR 

SGACKXWIiDS 

HMGRclustalW{ yeastl} VA5AMRGCKA INAGGGATTV LTKDGMT.RG PV^/RFPTLKR 

SGACXIWUDS 

HMGRclustalW{ahyccmyces} VASTARGCKA INAGGGASTI VIADGMT.RG PCVSFPTILR 
AA^VCKLWIEN 

EMGRclustalW{ fusarium} VA5ASRGCKA INSGGGAXTV LTADGMT.RG PCVAFSTLSR * 
AGAAICLWLDS 

KMGRclustalW{ Candida} VASAMRGCKA INIiGGGVTTV LTKDGMT.RG PCVKFPSLKR 
AGQCKLVniiDS 

HMGRclustalW{dictyost:e2} VASTHRGAKA ITKSGGAKTV LLQSGMT.RA PVCRX.PSSIR 
AGHLKQWISN 

HMGRclus t al W { whe a 1 1 } 



EMGRclustalW{ rice} 
AAHLIC^AEA 

HMGRclus talw{ coini} 
AA£L1C;VFL£D 

HMGRclus t:alW{ wheat 3 } 



HMGRclus talw{wheat:2} 
HMGRclus talW{ soybean} 



VASVNRRVQG 


HLVSGGAFSV 


LLRDAMS 


.RA 


PAVKLPCPMR 


VASTNRGCKA 


IA2SGGATSV 


VLRDAMT 


.RA 


P^/ARFPTARR 







EMGRclusi:alW{rubbertre3} VASANRGCKA lYASGGATSV LLRDGMT.RA PWRF^TAXH 
AADLKFFMED 

EMGRclustalW{rosyperiwi} VASTNRGCKA ILASGGANSV LLRTCMT-RA PWRFGTAKR 
AAELKFYMED 

HMGRclus talW{ tomato} VASTNRGCKA lYASGGATCI LLRCGMT.RA PCVRFGTAXR 

AAHLKFFV2D 

HMGRclus talW{woodtobacc} VASTNRGCKA lYASGGATSV LLRDGMT.RA PCVRFGTAKR 
AAELKFFVEIP 

HMGRclustalW{ potato} VASTNRGCKA IFVSGGADSV LLRDGMT.RA PVV"RFTTAKR 

AAELKFFVED 

HMGRclustalW{radish} VASTNRGCKA MYVSGGATST VLKDGMT.RA P^/VRFASARR 
ASELKFFLES 

HMGRclustalW{arabadopsisl} VASTNRGCKA MFISGGATST VLKDGMT.RA PWRFASARR 
ASELKFFLEN 

HMGRclus talW{cucumismel} VASTNRGCKA lYASGGATSM LLKDGMT.RA PWRFGSAKR 
ASELKFFLED 

EMGRclustalw{rubbertre2 } 



HMGRclus talW{rubbert re 1} VASTNRGCKA lYLSGGATSV LLKDGMT.RA PWRFASATR 
AAELKFFLED 

HMGRclus CalW{ camp tothec} VASTNRGCKA IFACGGATSV LLRDAMT.RA PWRFGSAKR 
AADLKFFLEN 

HMGRclus talW{arabadops 2} VASTNRGFKA IHLSGGAFSV LVKDAMT.RA PWRFPSARR 
AALVMFYLQD 

HMGRclus talW{chineseham} VASTNRGCRA IGLGGGASSR VLADGMT.RG PWRLPRACD 
SAEVKAWLET 
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EMGRclustalW{chineseha2} VASTNRGCRA IGLGGGASSR VLADGMT.RG PVVRIiP!?ACD 
SAEVKA5^ET 

HMGRclustalW{syrianhamsc} VASTNRGCHA IGLGGGASSR VLADGMT.RG PWRLPRACD 
SAEVKAWLET 

HMGRcluStalW{ rat} VASTMRGCRA ISLGGGASSR VLADGMS . RG PWRiPRACD 

SAEVKSWLET 

HMGRcluscalW{ rabbit} VAST25RGCRA IGLGGGASSR VLADGMT.RG PWRLPRACD 

SAEVKAWLST 

HMGRclustalW{ human} VASTNRGCRA IGLGGGASSR VLADGMT.RG PWRLPRACD 

SAEVKAWLET 

HMGRclustalw{ mouse} , 



HMGRclustalW{ xenopus} VASTNRGCRA IMLGGGAKSR VLADGMT.RG P^/^/?j;,PTACD 
AAEVKAWLDS 

HMGRclustalW{sea urchin} VASTNRGCRA LRSAGGIHSV LIGDGMT.RG PL^/HLPSAQE 
AGAXKQWLEV 

HMGRclustalw{ cockroach} VASTNRGMRA LMRCG . VTSR IVADGMT . RG PWRFPNIDR 
AS.EAMLWMQV 

HMGRclustalW{drosophila} VASTNRGCXA LSVRG.VRSV VEDVGMT . RA PO/Rr?SVAR 
AAEAXSWIEN 

HMGRclustalW{dictyosCel} VASTHRGCKA ITESGGAXCT ITSRGMT.RA PWRFSDIVK 
ASEFVSWIND 

HMGRclustalW{schistOSCm} VASTNRGCRA IrLAGGIXSV VYRDQMT . RA PWWFPSXID 
SVKCIAWIDS 

EMGRclustalW{archaeoglo} VAAASNAAR.M ARSSGGFTTD YTGSLMIGQI QVTKLLNPNA 
AKFEVLRQKD 

HMGRclustalW{pseudcmcnas} VAAASYMAXu ARANGGFTTS SSAPLMKAQV QIVGIQDPLN 
ARLSLLRRKD 

Consensus VAST^IRGCXA I-LSGGATSV VLADGMT-RA PWRFPSAKR 

AAELXFWLED 



02061 072A2J.> 



wo 02/061072 



PCT/US02/00255 



65/78 



FI&. 5ZHH 



850 



801 



IVAGSYVYPR 



yeast2} EEGQNSIKiC^ FNSTSRFAiU, QEIQT CliAGDLiFMR 

yeastl} EEGQNAIKKA FNSTSRFARL QEIQT CI*AGDLLrMR 



HMGRclustalW{methaiiobac} NM. .DAIiRSE AZSTT3HGKL VKXDdi 
FVYTTGDSMG 

HMGRclustalWfmethanococ} NF . . ERIXSV AESTTRKGKL IKI2PI.. LIVGPNLY^^^ 
FVFKTGDAMG 

HMGRclustalM{halobacter} NF. .AALKSA AEETTMKGEL UDVTP . . . YVVGNSWrR 
FRYDTKDAMG 

HMGRclustalW{sulfolcbus} NL . . EKIRNI ANSTSEKGKL KSITP... FVLG^INVWLR 
FSFETGDAMG 

KMGRclustalW{ 
FRTTTGDAMG 

HMGRclusCalW{ 
FRTTTGDAMG 

HMGRclustal^/^fphyconnyces} EG. NDIVTNA FNSTSRFARi RXLKI AiiAGrCLVI^ ■ 

FSTTTGDAMG " ' \ 

EMG Rclustal^{ fusarium} EAGQDMMKJCA FNSTSRFARi QSMKT ALAGt-stLYIR 

FKTTTGDAMG 

HMGRcluscalW{ Candida} DEGQESMiGCA FNSTSRFARi QELQT. . . . ALAGDii^^Tt^ 
FRTVTGDAiMG 

HMGRclustalW{dict:yoste2} QENFYQVASA FNSTSRFARi XSIW 
FKSSTGDAMG 

HMGRclust:alw{wheatl} 

GDAMG 

HMGRclustalW{ rice} PANFEilAAV FNRSSRFGRi QDIRC. . . AiAGP'^TiVMR 

FSCITGDAiMG ^ 

EMGRclustalWf corn} PANFDTLSW FNRSSRFARi QGVQC. . . . AMAGR^Ii'-T^R 
FSCSTGDAiMG 

HMGRclustalW{wheat3 } 

GDAMG 

H>!GRclust:alw{wheat:2} 

GDAMG 

HMGRclust:alW{ soybean} 



VXAGRiV-YLR 



AIAGKNiYMR 



HMGRclusCalw{mbbertre3} PDNFDTIAW FNKSSRFARi QSVQC 
FSCSTGDAMG 

HMGRcluscalW{rosyperiwi} TQNFETISW FNKSSRFAKL QSVQC... AJAGKNiviR 
FSCSTGDAMG 

HMGRclustalW{ tomaCo} PIKFSSLANV FNQSSRFARi QRXQC AXAGKNLYMR 

LCCSTGDAMG 

HMGRclusCalW{woodtobacc} PVKFETLAAV FNQSSRFARi QRIQC. . . . AiAGKNiv>iR 
FVCSTGDAMG 

HMGRclustalW{ potato} PiNFETLSLM FNKSSRFARi QGIQC AJAGiCNiVTT 

FSCSTGDAMG 

EMGRclast:alW{radish} PENFETLAW FNRSSRFARi QSVMC. . TiAGKNAvvR 
FSCSTGDAMG ' ... 

HMGRclusC3lW{arabadcpsisl} PEJIFDTLAW FNRSSRFARi QSVKC. . . TIAGKNAYVR 
FCCSTGDAMG 

HMGRclustalW{cuc'Uinismel} PSNFDTLAW FNRSSPJARi QSIRC. SIAGKNiYVR 
FCCSTGDAMG 

HMGRclustalW{ 2nibcertre2 } 



HMGRcluscalW{3rubbert:rel} PDNFDTLAW FNKSSRFARi QGIKC SIAGKNiYIR 

FSCSTGDAMG 

HMGRclustalW{campCothec} PiNFETiAAV FNSSSRFGKi QNIKC AIAGKNiYMR 

YSCSTGDAMG 

fiMGRclustalW{arabadops2} PSNFERiSLI FNKSSRFARi QSITC TIAGRNiYPR 

FACSTGDAMG 

HMGRclu5ralW{chineseham} PEGFAVTKDA FDSTSRFARi QKiKV TMAGSNiYIR 

FQSKTGDAMG 
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HMGRc lustalWf chines eha2} PEGFAVIKDA FDSTSRFAiH, QKIJIV TMAGRNIiYT^ 

FQSKTGDAMG 

HMGRclustalW[syrianhamst} PEGFAVIKDA FDSTSRFARL QKLHV TMAGSiNLviR 

FQSKTGDAMG 

HMGRclustalW{ rac} PEGFAWXSA FDSTSRFARIi QKLHV TLAGRNLY"R 

LQSKTGDAMG 

HMGRclust:alW{ rabbit} PSGFAVIKSA FDSTSRFARL QKLHI . SMAGR^TLYIR 

FQSRTGDAMG 

HMGRclustalWf human} SEGFAVIKZA FDSTSRFARL QKLKT SIAGRNLVXR 

FQSRSGDAMG 

HMGRclustalW{ mouse} 



HMGRclustalWf xenopus} A2GFK77XKDA FDSTSRFARL GRLQN CVAGRNLYIR 

FQSKTGDAMG 

HMGRclusCalWfsea urchin} PENFAAIKSR FESTSRFAKL KSIQT ALAGRYMFL? 

FXALTGDAMG 

HMGRclustalW{ cockroach} PlfNFSQIKKN FDSTSRFARL SKIKI RVAGRKLFIR 

FIATTGDAMG 

HMGRclustalW{drosophila} DENYR^/VKTH FDSTSRFGRL KDCHI AMDGPQL^IR 

FVAITGDRMG 

EMGRclustalW{dicCyostel} TDNYQALKAV FDSTSRFARL SAIKC. . . . . TI^VGRSV^TTi 
FKCDTGDAMG 

HMGRclusCalW{schist:osom} EEGFQTLKSA FDKTSAHVNL LSVFA CPAGRYIH"R 

FAARTGDAMG 



HMGRclusCalW{archaeoglo} EIIERAJ^JECD PMLVNLGGGC KDIEAR.VID TIMGK3'tL'»'VK 
LIVDVKDAMG 

HMGRclustalW{pseudomonas} EIIELANHXD QLLNSLGGGC RDISVHTFAD TPRGPMLVAr 
LIVDVRDAMG 



Consensus PENFHTLK-A FNSTSRFARL QSIQC AIAGRNLYIR 

FSCSTGDAMG 



NADE binding domain 1 
(continued) * 
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HMGRclusCalw{mechanobac} MNMVTIATHR ALELLT. . .R 
PAAVNLIEGR 

HMGRclust:alw{cnethanococ} MNM^/TIATEX ACNFIEGELK 
PSGiMNLINGR 

HMGRclusCalw{halobaccer} MNMATIATEA VCGWE. . .A 
PAAINAVEGR 

HMGRclustalW{sulf Globus} MNMVTIAV2X VCEFIS . 

QTNVNSLFGR 

HMGRclusCalW{ yeasC2} MNMISKGVEY SLKQMVEEY. 

FAAXNWIEGR 

HMGRclustalW{ yeastl} MNMISKGVEY SLKQMVEEY. 

PAAINWIEGR 

HMGRclus tal W {phyccmyces } MNM 



ETGASV..IA LSGNLCTDKK 
KEGIFVKTVA VSGNACVDKK 
ETAASL..VA LSGNLCSDKK 
ENFPSADCIiA VSGNMCSDKK 
-GWEDMZWS VSGNYCTDKK 
.GWEDMEWS VSGNYCTDKK 



HMGRclusCalW{ fusarium} MNMISKGVEK ALSVMAIIDG . .GFDDMQIIS VSG^r/CTDKK 
AAALNWIDGR 

EMGRclusCalW{ Candida} MNMXSKGVHY AIiKQMTEVF. .GWDDMi«n/VS VSGNYCTDKK 
PAA^/NWINGR 

HMGRclust:alW{dicryost:e2} MNM^/SKGVEK AIiEVITEY- . ..FPEMEVLS LSGNVCTDKK 
PSS INWLEGR 

HMGRcluscalW{wheatl} MtJMVSKGVEN VI.GYIRNN. . . . F?D^EDVrs ISG^r/CSDKK 
ATAVNWIDGR 

KMGRclustalW{ rice} MNMVSKGVEN VLGYIiQNV. . . -FPDMDVXS VSGNYCSDKK 

PTAVNWIEGR 

HMGRclust:alW{ com} MN^fVSKGVQN Vt*DFLQDD- - - .FHD^^DVIS ISGNFCSDBOC 
PSAVNWIEGR 

HMGRclus talW{wheat:3} MNMISKGVQN VLDYLQDD. . ..FPDMDVXS ISGNFCSDKK 
PAAVNWIEGR 

HMGRclustalW{wheat:2} MNMISXGVQK VLDYLEED . . . . FPDMDWS ISGNFCSDKK 
SAAVNWIEGR 

HMGRclus taiw{ scybean} 



HMGRclustalW{irubbsrt:re3} M^TM^/SXAVQN VXDYLQND . . . 
AAAVNWIEGR 

HMGRclustalW{rosyperiwi} MNTTVSKGVQN VLEFLQTD . . . 
PAA'/NWIEGR 

EMGRclustalW{ tomato} MN>rVSKGVCN VLDYLQNE . . . 

PAAVNWIEGR 

HMGRclastalW{wccdrobacc} MNMVSKGVQN VLDYLQNE . . . 
PAAVNWIEGR 

HMGRclustalW{ potato} MNMVSKGVQN VLDYLQSS . . . 
PAAVNWIEGR 

HMGRclusCalW{ radish} MNMVSKGVQN VLSFLTED . . . 
PAAVNWIEGR 

HMGRclustalW{arajbaccpsisl} MNM^/SKGVQN VLEYLTDD . . . 
PAAVNWIEGR 

HMGRclustalW{cucumismel} MNMVSKGVQN VLEFLQED. . . 
PAAVNWIEGR 

HMGRclus tal W{rubbertre2} LESD. . . 

PAAVNWIEGR 

HMGRclustalW{rubbercrel} MNMVSKGVQN VLSFLQSD . . . 
PAAVNWIEGR 

HMGRclustalW{camptothec} MNMISKGVQN VLDFLQDD. . . 
PAAVNWIEGR 

EMGRclustalW{arabadops2} MNMVSKGVQN VLDFVKSE. . . 
ASAVNWIEGR 

HMGRclus talW{chir:eseham} MNMISKGTEK ALLKLQEF. . . 
PAAINWIEGR 



. . FPDf^DVXG LTGNFCADKK 
. .YPDMDVLG ISGNFCADKK 
. YPDMDVIG ISGNFCSDKK 
-YPDMDVIG ISCTTFCSDKK 
.YPDMDVIG ISGNFCSDKK 
.FPDMDVIG ISGNFCSDKK 
.FFDMDVIG ISGNFCSDKK 
.FSDMEVIG ISGNFCADKK 
.FADMDVIG ISGNFCSDKK 
.FSDMDVIG ISGNFCSDKK 
.FPDMDVIG ISGNYCSDKK 
.FPDMDVIG ISGNYCSDKK 
.FPEMQILA VSGNYCTDKK 
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HMGRclust:alW{chineseha2 } 
PAAINWIEGR 

EMGRc 1 US ta 1 W { syr i anhams t } 
PAAVNWIEGR 

HMGRclu3talW{ rat} 
PAAINWIEGR 

HMGRclustalW{ rabbit} 
PAAV17WISGR 

HMGRclus t al W { human } 

PAAINWIEGR 

EMGRc lustalw{ mouse} 
PAAINWIEGR 

HMGRclus talW { xsnopus } 
PAAINWIEGR 

HMGRclustalW{sea urchin} 
VAAXNWISGR 

HMGRclus talW{ cockroach} 
PAAVNWIEGR 

HMGRclus tal W { drosophi la } 
PAAINWIKGR 

HMGRclustalw{dictyostel} 
PSSINWTEGR 

HMGRclus calW { schis tosom} 
PATINTILGR 

HMGRclus tal W { archaeoglo } 
KAVFDKDVIG 

HMGRclustalW{pseudomonas} 
QVRITPQQLS 



MNMISKGTEK ALIiKLQEF. . 

MNMISKGTEK ALVKLQEF. . 

MNMISKGTEK ALLKLQEG. . 

MNMISKGTEK ALSKLHEY. . 

MNMISKGTEK ALSKLHEY . . 

EK ALLKLQEF. . 

MNMISKVTSQ AIiARLQEE. . 
MNMISKGTEQ ALHALQTM. . 
MNMIiSKGTEV ALAYVQQV. . 
MNMVSKAXiRW PFASrTI^H. . 
MNM^/SKGVEA VLSKLKII . . 
MNMVSKATDS ALKCLKKY. . 
ANAVNTMCSK VAPFIERITG 
ANTVNTMAEA VAPLMEAITG 



..FPEMQILA VSGNYCTDKX 
. . FPEMQILA VSGNYCTDKK 
. .VPSLQILA VSGNYCTDKK 
- . FPEMQILA VSGNYCTDKK 
. . FPEMQILA VSGNYCTDKK 
. . FPDMQILA VSGjrYCTDKK 
. . FPDLE^/LA VSGNYCTDIGC 
. . FPNIEIMS LSGNYCTDKK 
..YPDMEILS LSGNFCTDrGC 
..FPDMQIIS LSGNFCCDKK 
. . FDDMTLLS ISGNMCTDKK 
. . FSNMQVIS LSGNMCTDKK 
.GKVYLRIIS NLAAYRIiARA 
-GQVRLRILS NLADLRLAPJ^ 



PAAVNWIEGR 



Consensus MNMVSKGVEN VL--LQED-- -GFPDMDVIS ISGNYCTDKK 



NADH binding domain 1 (concluded) 
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HMGRclustalWfmethancbac} GXSITAEITV PGEMVESVIiK TTPEAWSVN TAiCVLIGSAA 
AG . . SMG . FN 

HMGRclust:alW{metlianococ} CKSIVAEVFL TSKSVNKYLK TTSCAIAST^N RLiGTVTGSA-^ 
SN..SMG.FN 

HMGRclustalW{halobacter} GRSVTADVRI PREVVSERI*K TTPERGRHLN TRiGVLVGSAK 
AA- .SLG.FN 

HMGRclust:alW{sulfolobus} GKTVIAEAI*! KKDVIRNIIJi SNAQLIHDIN IJiiOiWLGTAR 
AG . . SLSQFN 

HMGRclustalW{ yeast2} GXSWAEATI PGDWKSVLX SDVSALVSLN ISiO/"LVGSA;-l 

AG . . S VGGFN 

EMGRclusCalW{ yeasCl} GKSWAEATI PGDVVR3C/LX SDVSALVELN lAiO^LVGSAl^ 

AG,. S VGGFN 

HMGRclust:alW{phycofnyces} 



HMGRclustalW{ fusarium} GXGWAEAII PGEWRSVLK SD^/DSLVHIiN VAiCWLIGSA?-' 
AG..SVGGFN 

EMGRclustalW{ Candida} GK3WAZASX PKDAWlC/IiK SS'ri:A^r7D\P<i VN^JL^GSAM 
AG . , SVGGFN 

HMGRclust:alW{dict:yost:s2} GKSWAHAVI SGDIV"RDVLK TTVZALV^^LN rDJO;XTGSA.M 
AG..SIGGFN 

EMGRclustalW{wheatl} GX3WCEATI KGRWQSVID TT^/SXLVELN II^OiLAGSAV 
AG. .AliGGFN 

EMGRclust:alw{ rice} GXSWCEAXI KGDWQKVLK TTV2KI*VSLN IliOilAGSAV 

AG. .ALGGF^I 

HMGRcluscalW{ corn} GKSWCSAVX GSSWKI-CVLX TaVQSnvSLN TISG^JLAGSAV 
AG..AI.GGFN 

HMGRclustalW{ wheats} GXSWCEAVX RSSLLKI-C/LK TNVQSLVSIiN VXXMAGSAV 
AG . . ALGGFN 

HI>!GRclustalW{wheaC2} GXSWCEAXI RS2WSXVLD T23VQSLVHI:N VXXNLAGS^V 
AG. .ALGGFN 

HMGRclust:alW{ soybean} LX TNVSAiVHLN MLXTJLAGSAV 

AG . . AX.GGFN 

HMGRclust:alW{rul3bertre3} GXSWCEAXI XESWXXVLX TNVAALVSLN MIJOrtiTGSAV 
AG . , SLGGFN 

HMGRclusCalW{rosyperiwi} GXSWCEAII XEEIVKTVLX TEVAALISIiN MViGtfLA<3SAT 
ACAIiGGFN 

HMGRclust:alW{ tomato} GXSWCEAXI TEEWXXVLX TF«/AALVSLN MLiO^LTGSAM 
AG. .ALGGFN 

HMGRclustalW{woodtobacc} GXSWCEAXI TESWKXVIaX TSVAALVELN MLiOiLTGSAiM 
AG. .ALGGFN 

HMGRclusCalW{ potato} GXSWCEAXI XZH^/VXXVLX TZ'/AALVELxV MLiCvn^TGSAM 

AG . . ALGGFN 

HMGRclustalW{radish} GKSWCEAVI RCZT\n>nC^rLK TSVASL^/ELN MLXIVLTGSA" 
AG . . SLGGFN 

EMGRclustalw{arabadopsisl} GXSWCSAVI RGSXVNXVLX TSVAALVELN MLiCWLAGSAV 
AG . . SLGGFN 

HMGRclustalW{cucumismel} GXSWCEAVX XDEWRXVLX TSVASLVELN MLJCVLTGSAM 
AG. .ALGGFN 

EMGRclustalW{rubbert:re2} GXSWCEAXI KEEWXXVLX TDVALLVELN MLiCS«LAGSAV 
AG. .ALGGFN 

KMGRclustalWfnibbertrel} GXSWCEAXI KEEWXXVLX TNVASLVSLN MLiCWLAGSAV 
AG. .ALGGFN 

HMGRclustalW{camptothec} GXSWCZAVI XESWXXVLX TNVASLVELN MLiCZtfLTGSAM 
AG. .ALGGFN 

HMGRclustalW{arabadops2} GXHWCEAFI XASIVEX'/LX TSVEALVELN TLXZyLVGSAM 
AG . . SLGGFN 

HMGRclus talW{chineseham} GXTWCEAVI PAXWREVLX TTTEAMXDVN XNK2yffiiVGSAM 
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AG..SIGGVN 

EMGRclustalW{ciiiiieseha2} GKTWCEA^/I PAKWRSVLK TTTEAMIDVN INJd^VGSAM 
AG. .SXGGYN 

HMGRclust:alW{syrianhamst} CKTWCEAVI PARWRSVLK TTTEAMIPVN INJOttiVGSAM 
AG. .SXGGYlf 

HMGRclustalW{ rat} GKTWCEAVI PAKWRSVLK TTTEAMVDVN I^TXZtfLVGSAf^ 

AG. .SIGGYN * 

HMGRclust:alW{ rabbit} GKTWCEAVI PAKWREVLK TTTEAMIDVN IJTiGWLVGSAiM 
AG. .SIGGYN 

HMGRclustalW{ human} GKSWCEAVI PAXWR3VLK TTTEAMIEVN INiGVLVGSAM 
AG. . SIGGYN 

HMGRclustalW{ mouse} GICn/VCEAVI PAKWRSVLK TTTEAMVDVN INi<]VLVGSAiM 
AG. • SIGGYN 

HMGHclustalW{ xenopus} GXSWCSAII PAKWRSVLK SSTEAiVEVN IN5GVFrGCA.v 

AG.. SIGGYN ^^^^^ 

HMGRclusCalW{sea urchin} GKSWCHA'r/ PAHIVQQVLK TSASALVDLN r-^^JLVGSAM 
AG. .SIGGFN 

HMGRclustalW{ cockroach} GKSWCEAIV PADIIXSVLK TSVQALxMDVN ITiCMLIGSW 
AG . . S IGGFN ^Y-ix^^.-L 

HMGRclustalW{drosophila} GKRWTECTI SAATLRSVLK TDAXTLVECN KLJOJMGGSAM 
AG . . S IGGNN 

HMGRclustalW{dictycstel} GRSWCEAMI TGDWQRVLK TNVQALVDLN lAiOWLIGSAM 
AG. .SIGGFN 

HMGRclust:alW{schistoscra} GXSVIAZAKL SADVLAQVXJi TNAQRUU?I.T HSKM^^gsat^ 
AGCPGMMGCN 

HMGRclustalW{archaecglo}. GEEW EGIMLAYAPA AADPFRCATH NiCGIMNGISA 

LM 

HMGRclustalW{pseudomonas} TAEFSGEAVI EGILDAYAFA AVDPYRAATH NiCGIMNGIDP 
Z«I 

Consensus GKSWCEAVI PAE^/VRiC/IiK TTVEALVELN ILiCNLVGSAM aG- 

5LGGFN 
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KMGRclust:alW{methanobac} AHYANIIGAX FLATGQi^EAK IVEGSLGyTI AE2RK, 

HMGRclustalWfmethanococ} AKYANIIGAI FLATGQDEAK IVSGSLGITM AEVED. 
• . . .DGLYr* 

HMGRclusCalW{halobacter} AKVANWAAM FLATGQDEAQ WEGAMAttt ;icVod 
. . . .GDLYV — V . 

HMGRclust:alW{sulf Globus} AKFANIVTAI FIATGQIVAQ IVESSSG^^W T^'/^G 
-EDLYI " ' 



HMGRclustalW{ yeast2} AHAANLVTAL FLALGQJDPAQ N^/HSSNCITL MXHVD 

.GDLRJ 



EMGRclust:alW{ yeastl} AHAANLVTAV FLALGQJDPAQ JT/iSSNCXTI, MKSVD 

.GDLRJ 



KMGRclusCalW{phycomyces } 



HMGRclustalW{ fusarium} AHAANIVAAI FLATGQCPAQ V^/SSA^ICITI MXNLN 
-GALQI 



HMGRclustalW{ Candida} AQAANMVTAV YIALGQDPAQ N^/ZSSNCITL MT^T^^ 
.GDLKV - — . 



HMGRcluscalW{dict:yoste2} AKASNIVTAXi YIATGQDPAQ JT.-HSSNCITL MESIMGG 
. . . . KDLYI 

EMGRclustalW{wheatl} AHASNIATAL FIATGQ£>?AQ NVSSSQCITM LHAVNSG 
. KDLHI 



HMGRclustalW{ rice} AKASNIVTAi FIATGQ£?PAQ NVSSSQCITxM LEEVTSTOG 
. . . .DDLHI 

HMGRclustalW{ corn} AHASNIVTAX FIATGQI?PAQ IT/ESSKCITM LEPVNAG. 
» • « > RJDLxiX 



HMGRclustalW{wheat:3} AHASNIVTAI FIATGQrPAQ NVESSQCZAM LEAVNCG 
KDIiHI 

EMGRclustalW{wheaC2} AKASNIVSAX FXATGQX/PAQ STvZSSQCXTM LEAV>iGG 
RDLHX 

HMGRcluscalW{ soybean} AHASNXVSAX FIATGQDPAQ NVESSKCXTM MEAXNCG 

HMGRclustalW{riibbertre3} AHASNMVTAV YIATGQDPAQ NVESSKCXTM MEAVNDG. , 
• . . . . ICDLHX 

EMGRclustalW{rosy?eriwi} AKASNIVSAX FIATGQOPAQ NVESSQCXTM MEA^/NDG. . 
..... i^LHX 

HMGRclustalW{ tomato} AKASNIVSAV FXATGQOTAQ NXSSSECXTM MEAVNDG. . 
KDLKI 

HMGRcluscaIW{wocdtobacc} AHASNIVSAV YXATGQOTAQ NIESSKCXTM MEAVNDG 
KDLHV 

HMGRcluscalW{ potato} AHASNIVSAV YLATGQDPAQ N^/ESSKCITM MEAVNDG 

HMGRclustalW{ radish} RHASNIVS.V/ FLATGQ£?PAQ NVESSCCITM MEAINDG 
KDIHI — ' 

HMGRclustalW{arabadcpsisl} AHASNIVSAV FXATGQ15PAQ N^/ESSQCXTM MEATndg 
KDIHI ifi^.x>iwi^.. 

HMGRcluscalW{cucumismel} AHSSNIVSAI FIATGQOTAQ NVSSSHCITM MEP^/NNG 
RDLHI 

HMGRclustalW{rubbertre2} AKAGNIVSAX FIATGQOTAQ NVESSKCITM MEAVNDG 
KDLHI 

HMGRclustalWfrubbertrel} AHAGNIVSAI FIATGQ£?PAQ NVESSHCITM MEAVNDG 
KDLHI 

HMGRclusCalW{camptothec} AHASNIVSAV YLATGQDPAQ NVESSHCITM MEA^^NDG 
. KDLHV 



HMGRclustalw{arabadops2} AHSSNIVSAV FIATGQOTAO N^/SSSHC.MTM ILPDGD. . 
. DLKI 



HMGRcluscalW{chineseham} AKAANIVTAI YIACGQCAAQ NVGSSNCXTL MEASGPTN, 
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EDLYX 

! HMG^lustalW{chineseha2} AHAANIVTAI YIACGQDAAQ NVGSSNCITL MEASGPTN. . 

\ HMGRclustalW{syrianhamst} AHAANIVTAI YIACGQ£>AAQ NVGSSNCITL MEASGPTN.. 

EMGRclustalW{ rat} LHAANIVTAI YIACGQiJAAQ NVGSSNCITL MEASGPTN 
EDLYI - - . 

HMGRclust:alW{ rabbit} AHAANYVTAI YIACGQDAAQ NVGSSNCITL MEASGPTN 
EDLYI 

HMGRclustalW{ human} AHAANIVTAI YIACGQDAAQ NVGSSNCITL MEASGPTN 
EDLYI . . 

HMGRclustalW{ mouse} AEAANIVTAX YIACGQDAAQ NVGSSNCITL MEASGPTN 
EDLYI 

HMGRclustalW{ xenopus} AHAANIVTAI YIACGQDAAQ NVGSSNCITI MEATG^aTY 
EDLYI 

HMGRclustalW{sea urchin} AHAANIVTAI YIATGQDAAQ NIASSNCMTL METRGPKG 
GDLYL ' .... 

HMGRclustalW{ cockroach} AHAANIVTAI FIATGQDPA.Q NVGSSNCMTL MEPWGEDG 
KDLYV • 

HMGRclustalW{drosophila} AKAANMVTAV FLATGQDPAQ NVTSSNCSTA MECWAENS 
EDLYM 

HMGRclustalWfdictyostSl} AEASNIVTAI FLATGQZX::AQ NVESSNCITO MEACNDG 
QDLYI 1^^. .jw... 

HMGRclustalW{schistoscm} AHAANIIAGM FAATGQDLAQ WDSSSCLTQ LEVDLSD 
DSLVA 

HMGRclustalW{archaeoglo} lATGNDFRA lEAGAHSYAA IGG YKPI^tt 

YEVDRKGNLV ' 

HMGRcluscalW{pseudamonas} VATGNDWRA VEAGAHAY"AC RSGF^gsiltt 

WEKDNNGHLV 

Consensus AHAANIVTAI FIATGQDPAQ NVESSNCITM MEAVNDGN-- - 

KDLHI 

D 
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1001 

1050 

HMGRclust:alW{methanobac} AVNLPDVPIA TVGGGTGLET ASECIiDIMCV RGGG 
RVHAFAEIVG 

HMGRclust:alW{methanococ} SVTLPDVPIG TVGGGTRVET QKSCLEMLGC YGDN 
KALKFAEIVG 

HMGRclust:alW(halobacter} SVSIASLEVG TVGGGTKLPT QSEG1J}ILGV SGGGDP AGS 
NADALAECIA 

HMGRclust:alW{sulfolobus} SVTLPSLEn/G TVGGGTRLPT QBCSALSIMGV YGSGN? t>r-s 
NAKKLAEIIA 

HMGRclust:alW{ yeast2} SVSMPSIEVG TIGGGTVLEP QGAiMUDLLGV RGPK^TEPGA 

NARQLARIXA 

HMGRclustalW{ yeastl} SVSMPSIEVG TIGGGT^/LEP QGAMLDLLGV RGPHATAPGT 

NARQLARIVA 

HMGRcluscalW{phycomyces } 



HMGRclustalW{ fusarium} SVSMPSLE^/G TLGGGTILEP CGAMLDILGV RGSH^TN-GD » 
NARRUVRIIG 

EWGRclustalW{ Candida} SVSMPSIEVG TIGGGTIUD? QGSMLELLGV RG.oi\DV^GS 
NARQLAKIVA 

HMGRclust:alW{dict:ycst:e2} SVTMPSIE^/G TVGGGTHLPA QSACLDLLXI RGa^NLSR^sGA 
NSEQLARV^/A 

HMGR.clustalW{ wheat 1} SVTMPPIEV. 

HMGRclusCalW{ rice} SVTMPSIEVG TIGGGTCLAS QAACLNLLGV KGSNEGSFGA 

NAGRIiAXrVA 

HMGRclustalW{ corn} SVTMPSIEVG TVGGGTQLAS QSACIiDLLGV RGAS?JDR«GS 
NARLIiATWA 

HMGRclusCalW{ wheats } SVTMPPIEV 



HMGRclustalW{wheat2} SVTMPPIEV. 



HMGRcluscalw{ soybean} SVTMPSIEVG TVGGGTQLAS QSACI^NLIiGV KGASXSS^GS 

NSRX.I1ATIVA 

HMGRclustalW{rubberrrs3} SVSMPSISLG TVGGGTQLAS QSACLNLLGV KGASKDS^GS 
NSRIiLATIVA 

HMGRclustalW{rosyperiwi} SVTMPSIEVG TVGGGTQLAS QSACLNXiLGV KGASKDSPGA 
NSRIiLATIVA 

HMGRclust:alW{ tomato} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGANHEAPGS 

NARliLATWA 

HMGRclustalW{woodtobacc} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGANREVPGS 
NARLUVTIVA 

HMGRclustalW{ potato} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGAITRDAPGS 

NARiLATIVA 

HMGRclustalW{ radish} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGASKESPGM 
NSRRLATIVA 

HMGRclustalW{arabadopsisl} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGASTES^^GM 
NARRUVTIVA 

HMGRclustalW{cucumismel} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGASKES^GA 
NSRLLATIVA 

HMGRclustalW{rT-Lbbertre2} SVTLPSIEVG TVGGGTQLAS QSACLNLLGV MGACKESPGS 
YSRIiLATIVA 

HMGRclustalW{rubbertrel} SVTMPSIEVG TVGGGTQLAS QSACLNI»LGV KGANKESPGS 
NSRLLAAIVA 

HMGRclustalW{camptothec} SVTMPSIEVG TVGGGTQLAS QSACLNLLGV KGASKSAPGS 
NARLIATIVA 

EMGRclustalW{arabadops2} SVSMPCIBVG TVGGGTQLAS QAACLNLLGV KGSNNEKPGS 
NAQQLARIVA 

HMGRcIustalw{ciiineseham} SCTMPSISIG TVGGGTNLLP QQACLQMLGV QGACKDNPGE 
NARQLARIVC 
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HMGRclusCalW{chineseha2} SCTMPSXEIG TVGGGTNLLP QQACIiQMLGV QGACKDN^'GH 

NARQLARIVC • * 

HMGRclustalW{syrianhamst} SCTMPSIEIG TVGGGTNLLP QQACLCMLG^/ QGACXDJI^sGF 

NARQLARIVC - ^-v^fi. 

HMGRclustalW{ rat} SCTMPSIEIG TVGGGTNLLP QQACLQMLGV QGACKDNPGE 
NARQLARIVC 

HMGRclust:alW{ rabbit) SCTMPSIEIG TVGGGTNLLP QQACLQMLGV CGACKDS^^G- 

NARQLARIVC - - - 

EMGRclustalW{ human} SCTMPSIEIG TVGGGTNLLP QQACLQMLGV QGACXDNPG^ 
NARQLARIVC 

HMGRclust:alW{ mouse} SCTMPSIEIG TVGGGTNLLP QQACLQMLGV QGACXDN=GS 
NARQLARIVC 

HMGRclustalW{ xeaopus} SCTMPSIEIG TVGGGTNLAP QQACLQMJ.GV QGASTSTPGK 
NACQLAQIVC 

HMGRclustalW{sea urchin} SCTMPSIELG T^/GGGTVLPP QSACLQMT'tDV KGS2TT--rGSGL 
NASQLARIVC 

HMGRclustalW{ cockroach} SCTMPSIEIG TIGGGTVLPP QAACLDMLGV RGANEMC^G^ 
NANTLARIVC 

BMGRclustalW{drosophila} TCTMPSLEVG TVGGGTGLPG QSACLEMLGV RGAF^^-t^ogD 
NAiCKLAQIVC 

HMGRclustalW{dictyosCel} TVTMPSISVG TVGGGTSLPA QSACLDIIG^/ YiQSSSSY:2G^ 
NADQLAiCTIA 

EMGRclustalW{schistosom} SVTMPCLEVG T^QQQT^SQ QPJJtCLDLLDL SV D R 

PTSHLSRIIA 



HMGRclustalW{archaecglo} GTISIPMAVG VIGGATI-O/NP LAXISLKILG VNTASSLARV 
MGRclustalW{pseudomonas} GTLSMPMPVG LVGGATI-CTH? LAQLSLRILG VKTAQALASI 

Consensus SVTMPSIEVG T^/GGGTQLAP QSACLNLLGV KGA-KZSPGS 



NARQLARIVA 

NADK binding domain 2 
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1051 

1100 

HMGRclustalw(meChanobac} GAVIAGELSL MGMiAAGKEA RAESELGRG. 



HMGRclustalW{methanococ} AAVIAGHLSI* LGALAAGELG KAHQELGR 

*HMGRclustalW{halobacter} VGSLAGELSL LSALASRHLS SAHA2LGR 

HMGRclustalWfaul^olofa"^} STVLSGELNIj LAALSNKELG KAH^JCLGRAxM KV. 



HMGRclustalW{ yeast2} CAVIAGELSL QSHMTH2raX. . .TNKANELP 

QPS 

HMGRclustalW{ yeastl) CAVLAGEI^SL OAALAAGELV QSHMTKNRK? ASPTXPNNLD 

ATDI 

EMGRclustalw{phyccmyces} 



HMGRclustalW{ fusarium} AAVIAGELSL CSALAAGKLV RAHMQHN"RSA APSRSTTPGS * 
SKDARLTGHD 

HMGRclust:alW{ Candida} SIVLSGELSL VSALAAGELV QSHMQKNRAA AKK 



EMGRclustalW(dict:yost:e2 } AA VLSGELSL MSAIiAAGHLV RSHLKH^TRKT EAPAPQADTI 
SMTHNLPHSD 

HMGRcluscaiW{wheat:l} 



HMGRclustalW{ rice} GSWAGRALL LAALASGHLV KSEMr-T/MRSS KDVAK. 

HMGRclusCalW{ cozm} GGVLAGSLSL LSAIAAGQLV KSHMKViTOSS KDVSS . 

HMGRcluscalW (wheats } 

HMGRc lus C al M { whea C2 } ' 

HMGRclustalW{ soybean} GSVLAGELSL MSAIAAGQIiV NSHMKVNHSS KD^/TK. 

* *HMGRcluscalW{ rubbercre3} GSVLAGELSL MSAIAAGQLV NSHMiC/NRSA KDVSK. 

HMGRclusCalWlrosyperiwi} GSVLAGELSL MSAISAGQLV RSHMKYOTtss KDITN. 
' 'HMGRclustalW{ coma to} GSVLAGELSL MSAISSGQLV NSHMKYNRST KDVTK. 

"HMGRciustalW(woodtobacc} GSVLAGELSL MSAISAGQLV KSHMKYNRST KDVTK. < 
HMGRclustalW{ potato} GSVLAGELSL MSAISAGQLV KSHMKYNRSI KDISK. , 

KMGRclustalW{ radish} GAVLAGELSL MSAIAAGQLV RSHMK'rMRSS RDISG,. 
HMGRcIustalW{arabadopsisl} GAVLAGELSL MSAIAAGQLV RSHMKYNRSS RDISG.. 

HMGRclustalW{cucumismel} GSVLAGELSL MSAIAAGQLV RSHMKYJIRSS RDVSK. . 
* *HMGRclustalW{rubberCre2} GSVLAGELSL MSAIAAGQLV KSEMKYNRSS KDVSK.. 
' *HMGRclustalW{rubberCrel} GSVLAGELSL MSAIAAGQLV KSHMKYNRSS KDMSK. . 
"fiMGRciustalWf camp to thee} GSVLAGELSL MSAIAAGQLV NSHMKYNTISN KDVTK.. 
'HMGRciu5talwfarabadops2} GSVLAGELSL MSAIAAGQLV KSHMKYNRSS RDKSP.. 
HMGRcius calW { chineseham} GTVMAGELSL MAALAAGHLV RSHMVHNRSK INLQD. . 
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HMGRclusCalW{chineseha2} GTVMAGSLSL MAAUVAGKLV RSHMVENXSK INLQD 
HMGRclustalwfsyrianhamst} GTVMAGELSL MAALAAGHLV RSHMVEMMK INLQD 



HMGRclustalW{ rat} GTVMAGELSL MAAIAAGHLV RSHMVENRSK INLQD 



fiMGRclustalW[ 


rabbit} 


GTVMAGELSL 


MAALAAGKLV 


KSBMXHNKSK 


INLQD ; 


HMGRclust:alW{ 


human} 


GTVMAGELSL 


MAALAAGHLV 


KSHMIKNRSK 




HMGRclust:alW{ 


mouse} 


GTVMAGELSL 


MAALAAGHLV 


RSHMVHMSSK 


INLQD 


HMGRclustalWf 


xenotjus } 


STVMAGELSL 


MAALAAGHLV 


KSHMVHSTRSK 


INLQD 


HMGRclust:alw{sea urchin} 
DEVATKRRSK 


. ATVMAGELSL 


MSALAAGHLV 


KSHMKHNSSA 


LNIASPLPSI 


HMGRclusCalW{ 


cockroach} 


GT^/LAGELSL 


MSALAAGHLV 


KSHMRSTRSS 


VSTSG 


HMGRclustalWfd 


rosophilal 


AT'/MAGELSL 


MAALWSDLV 


KSHMHH^rHSS 


lA^/MSAN . . . 


HMGRclustalw{d 


ictyostel} 


SAVMAGELSL 


MSALSAGKLM 




T^T 


HMGRclustalW{ schistosoma 


GTVLAAHLSL 


MAALDTDDLV 


KAHMHFJIRAK 


QST2>lSsiSCSK 


STTTDNNDNI 












KMGRcluscalW{archaeoglo} 
EVDRWEIMV 


. . GLAQNFAA 


LRALATEGIQ 


RGHMELiiARN 


LAIMAGATGD 



HMGRclustalW{pseudomoGas} . .GLAQNILGA MRALATEGIQ RGHMALHARN lAV^/AGARGD 
EVDWVARQLV 



Consensus GTVLAGELSL MSALAAGHLV KSHMK-NRSS KDVSK 
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1101 

1152 

HMGRc 1 us t al W { me t hanobac } 



HMGRclus tal W {me thanococ } 

HMGRclust:alW{halobact:er} 

HMGRclustalW{sulf Globus} 

HMGRclus talw{ yeast:2} NKGPPCKT SAiL 

HMGRcliistalW{ yeastl} NRI*KDGSV TCIKS . . . 

HMGRclus talW{phycornyces} 

HMGRclust:alW{ fusarium} QCPRALSVNN VDERRRYSSV KAXDE... 

HMGRclustalw{ Candida} 

HMGRclus£alW{dictyoste2} 

HMGRclustalwfwheaci} ' 

HMGRclustalW{ rice} a AS 

HMGRclustalW{ com} t TATSK TRQRSVDV. 

HMGRclus talw{ wheats} 

HMGRclus talW { wheat2 } 

HMGRclustalW{ soybean} t g 

HMGRclus talW{rubbertre3} t XF 

HMGRclust:alW{rosyperiwi} i ASSQL ESDS 

HMGRciustalW{ tomato} a SS 

HMGRclus talW(woodtobacc} a SS 

HMGRclustalW{ potato} 

HMGRclus talWf radish} a TTTT 

HMGRclus talW{arabadopsisl} a TTTTT TTT 

HMGRclustalW{cucumismel} L ES 

HMGRclus talw { rubbertre2 } a AS 

HMGRclustalw{rubbertrel} a AS 

HMGRclus talw {camp to thee} a SS 

HMGRclus calw { arabadops2 } s SQVNR 

HMGRclus talw {chineseham} LQGTCTK KSA 
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HMGRcl US talW{ chines eiia2} LQGTCTK KSA. 

HMGRclust:alW{syxianhamst} LQGTCTK KAA. 

HMGRclustalW{ rac} LQGTCTK KAA. 

HMGRclustalW{ rabbit} LEGACTK KAA. 

HMGRclustalW{ human} LCGACTK ,KTA. 

HMGRclust:alW{ mouse} LQGTCTK KAA. 

HMGRclust:alW{ xenopus} LPGTCTX KAA. 

KMGRclust:alW{sea urchin} SVDFSALKES SAAAPGTCTA MAS. 

HMGRclust:alW{ cockroach} S . . .EPSTPAC KS . . . 

HMGRclustalW{drosophila} N? LNVTVSSCST IS.., 

HMGRclust:alW{dict:yoscel} 



HMGRclustalW{schist:osom} SNIYDNHWA LSSKIP^/TDN SDIR2SVKSL HVKP^'tsvxSD 
LSVNPEISKY TM 

HMGRclustalw{archaeoglo} RBGKXRLDYA KSVLERLRS 



HMGRcluscalW{pseudomonas} EVHDVRADHA VALLKQIGiGQ .... 

Consensus »>i a LQGTCTK KAA- 
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SEQUENCE LISTING 

<110> Monsanto Company 

<12 0> TRANSGENIC PLANTS CONTAINING ALTERED LEVELS OF STEROID COMPOtJNDS 
<13 0>- MTC6783.3 
<160> 33 

<170> Patentin version 3.0 

<210> 1 
<211> 585 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 1 

Met Lys Pro Phe Val lie Arg Asn Leu Pro Arg Phe Gin Ser Thr Leu 
15 10 15 

Arg Ser Ser Leu Leu Tyr Thr Asn His Arg Pro Ser Ser Arg Phe Ser 
20 25 30 

Leu Ser Thr Arg Arg Phe Thr Thr Gly Ala Thr Tyr He Arg Arg Trp 
35 40 45 

Lys Ala Thr Ala Ala Gin Thr Leu Lys Leu Ser Ala Val Asn Ser Thr 
50 55 60 

Val Met Met Lys Pro Ala Lys He Ala Leu Asp Gin Phe He Ala Ser 
65 70 75 80 

Leu Phe Thr Phe Leu Leu Leu Tyr He Leu Arg Arg Ser Ser Asn Lys 
85 90 95 

Asn Lys Lys Asn Arg Gly Leu Val Val Ser Gin Asn Asp Thr Val Ser 
100 105 110 

Lys Asn Leu Glu Thr Glu Val Asp Ser Gly Thr Asp Val He He Val 
115 120 125 

Gly Ala Gly Val Ala Gly Ser Ala Leu Ala His Thr Leu Gly Lys Glu 
130 135 140 

Gly Arg Arg Val His Val He Glu Arg Asp Phe Ser Glu Gin Asp Arg 
145 150 155 160 

He Val Gly Glu Leu Leu Gin Pro Gly Gly Tyr Leu Lys Leu He Glu 
165 170 175 

Leu Gly Leu Glu Asp Cys Val Lys Lys He Asp Ala Gin Arg Val Leu 
180 185 190 

Gly Tyr Val Leu Phe Lys Asp Gly Lys His Thr Lys Leu Ala Tyr Pro 
195 200 205 
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Leu Glu Thr Phe Asp Ser Asp Val Ala Gly Arg Ser Phe His Asn Gly 
210 215 220 • 

Arg Phe Val Gin Arg Met Arg Glu Lys Ala Leu Thr Leu Ser Asn Val 
225 230. 235 240 

Arg Leu Glu Gin Gly Thr Val Thr Ser Leu Leu Glu Glu His Gly Thr 
245 250 255 

lie Lys Gly Val Arg Tyr Arg Thr Lys Glu Gly Asn Glu Phe Arg Ser 
S 260 265 270 

Phe\Ala Pro Leu Thr lie Val Cys Asp Gly Cys Phe Ser Asn Leu Arg 
275 280 285 

Arg Ser Leu Cys Lys Pro Lys Val Asp Val Pro Ser Thr Phe Val Gly 
290 295 300 

Leu Val Leu Glu Asn Cys Glu Leu Pro Phe Ala Asn His Gly His Val 
305 310 315 320 

Val Leu Gly Asp Pro Ser Pro He Leu Met Tyr Pro He Ser Ser Ser 
325 330 335 

Glu Val Arg Cys Leu Val Asp Val Pro Gly Gin Lys Leu Pro Pro He 
340 345 350 

Ala Asn Gly Glu Met Ala Lys Tyr Leu Lys Thr Arg Val Ala Pro Gin 
355 360 365 

Val Pro Thr Lys Val Arg Glu Ala Phe lie Thr Ala Val Glu Lys Gly 
370 375 380 

Asn lie Arg Thr Met Pro Asn Arg Ser Met Pro Ala Asp Pro He Pro 
385 390 395 400 

Thr Pro Gly Ala Leu Leu Leu Gly Asp Ala Phe Asn Met Arg His Pro 
405 410 415 

Leu Thr Gly Gly Gly Met Thr Val Ala Leu Ala Asp He Val Val Leu 
420 425 430 

Arg Asp Leu Leu Arg Pro He Arg Asn Leu Asn Asp Lys Glu Ala Leu 
435 440 445 

Ser Lys Tyr He Glu Ser Phe Tyr Thr Leu Arg Lys Pro Val Ala Ser 
450 455 460 

Thr He Asn Thr Leu Ala Asp Ala Leu Tyr Lys Val Phe Leu Ala Ser 
465 470 475 480 

Ser Asp Glu Ala Arg Thr Glu Met Arg Glu Ala Cys Phe Asp Tyr Leu 
485 490 495 

Ser Leu Gly Gly Val Phe Ser Ser Gly Pro Val Ala Leu Leu Ser Gly 
500 505 510 
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Leu Asn Pro Arg Pro Leu Ser Deu Val Leu His Phe Phe Ala Val Ala 
515 520 525 

lie Tyr Ala Val Cys Arg Leu Met Leu Pro Phe Pro Ser He Glu Ser 
530 535 540 

Phe Trp Leu Gly Ala Arg He He Ser Ser Ala Ser Ser He He Phe 
545 550 555 560 

Pro He He Lys Ala Glu Gly Val Arg Gin Met Phe Phe Pro Arg Thr 
565 570 575 

He Pro Ala He Tyr Arg Ala Pro Pro 
580 585 



<210> 


2 


<211> 


418 


<212> 


DNA 


<213> 


Ar abi dops i s t hal i ana 


<220> 




<221> 


Unsure 


<222> 


(1) . . (418) 


<223> 


n=a, c, g or .t 


<400> 


2 



cttacgcgtg 


gttatngacg 


cttctcgcct 


ttgttctgac 


atggatgatt 


tttcacctca 


60 


tcaagatgaa 


gaaggcggca 


accggagatt 


tagaggccga 


ggcagaagca 


agaagagatg 


120 


gtgcaacgga 


tgtcatcatt 


gtnggggcgg gtgttgcagg 


cgcttctctt 


gcttatgcnt 


180 


tagctaagga 


tngacgacga 


gtacatgtga 


tagagangga 


cttaaaagag 


ccacaaagat 


240 


tcatgggaga 


nctgatgcaa 


ncgggaggtc 


gctttcatgt 


taagcccagc 


ttggcctcga 


300 


agattgttnt 


ggaggacatn 


gacgcacaag 


aatncgaaan 


cctttggcat 


atnccaagnn 


360 


tggaaacacg 


cgaaatggcc 


tttccanatg 


aaaagaantt 


tcctcatgag 


ccagtagg 


418 



<210> 


3 


<211> 


354 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


Unsure 


<222> 


(1) . . (354) 


<223> 


n=a, c, g or t 


<400> 


3 



gcaatgactt acgcgtggtt atggacgctt ctngcctttn tnctgacatg gatggttttt 60 
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cacctcanca agatgaagaa ggcggcaacc ggagatttag aggccgaggc agaagcaaga 12 0 

agagatggtg caacggatgt natcattgtt ggggcgggtn ttgcaggcgc ttctnttgct 180 

tatnctttag ctaaggatgg acgacgagta catgtgatag agagggactt aaaagagcca 24 0 

caaagattca tgggaganct gatgcaagcg gggaggtcgc ttcatgttag cccagnttgg 300 

cctcgaagat tttttttgna gggcataaga cgnaccaana agcggaatnc cttt 354 

<2ai0> 4 
<2il> 182 9 
<21?> DNA 

<213> Arabidopsis tha liana 
<400> 4 

gaattcccgg gtcgacccac gcgtccgctt atagataagg atatggcctt tacgaacgtt 6 0 

tgcctatgga cgctactcgc cttcatgctg acttggacag tgttctacgt cacaaacagg 12 0 

gggaagaagg cgacgcagtt ggcggatgcg gtggttgaag agcgagaaga cggtgctact 180 

gacgttatca tcgttggggc tggagtaggc ggctcggctc tcgcatatgc tcttgctaag 240 

gacgggcgtc gagtccatgt aatagagagg gacctgagag aaccagagag aatcatgggt 300 

gagtttatgc aaccaggagg acgactcatg ctctctaagc ttggtcttga agattgtttg 3 60 

gagggaatag atgcccaaaa agccacgggc atgacagttt ataaggacgg aaaagaagca 420 

gtcgcatctt ttcccgtgga caacaacaat tttccttttg atccttcggc tcgatctttt 480 

cacaatggcc gattcgtcca acgattgcgg caaaaggctt cttctcttcc caatgtgcgc 540 

ctggaagaag gaacggtgaa gtctttgata gaagaaaaag gagtgatcaa aggagtgaca 600 

tacaaaaata gcgcaggcga agaaacaaca gccttggcac ctctcactgt agtatgcgac 660 

ggttgctact caaaccttcg ccggtctctt aatgacaaca atgcggaggt tctgtcatac 72 0 

caagttggtt ttatctcaaa gaactgtcag cttgaagaac ccgaaaagtt aaagttgata 780 

atgtctaaac cctccttcac catgttgtat caaatcagca gcaccgacgt tcgttgtgtt 84 0 

tttgaagttc tccccaacaa cattccttct atttcaaatg gtgaaatggc tactttcgtg 900 

aagaacacta ttgctcctca ggtaccttta aaactccgca aaatattttt gaaagggatt 960 

gatgaaggag aacatataaa agccatgcca acaaagaaga tgacagctac tttgagcgag 102 0 

aagaaaggag tgattttatt gggagatgca ttcaacatgc gtcatccagc aatcgcatct 1080 

ggaatgatgg ttttattatc tgacattctc attttacgcc gtcttctcca gccattaagc 1140 

aaccttggca atgcgcaaaa aatctcacaa gttatcaagt ccttttatga tatccgcaag 1200 
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ccaatgtcag cgacagttaa cacgttagga aatgcattct ctcaagtgct agttgcatcg 1260 

acggacgaag caaaagaggc aatgagacaa ggttgctatg attacctctc tagtggtggg 1320 

tttcgcacgt cagggatgat ggctttgcta ggcggcatga accctcgtcc gatctctctc 13 80 

atctatcatc tatgtgctat cactctatcc tccattggcc atctactctc tccatttccc 1440 

tctccccttg gcatttggca tagccttcga ctttttggtt tggctatgaa aatgttggtt 1500 

ccccatctca aggctgaagg agttagccaa atgttgtttc cagtcaacgc cgccgcgtat 1560 

agcaaaagct atatggctgc aacggctctt taaaacactg gtgctttaaa ctgcaaaata 1620 

taacacatat ataaatcccg aatctttgtg attctgcata tattgtgttc tacaattatt 168 0 

ctcatataaa tgaaaattgt tctacgtaaa agtaaaaaga aggaattgta atactaataa 1740 

aacgagtttt taattctgtt gaatgcttgt gtatattggt gaaaaaaaaa aaaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaag ggcggccgc 182 9 

<210> 5 
<211> 530 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 5 

Glu Phe Pro Gly Arg Pro Thr Arg Pro Leu lie Asp Lys Asp Met Ala 
3-5 10 15 

Phe Thr Asn Val Cys Leu Trp Thr Leu Leu Ala Phe Met Leu Thr Trp 
20 25 30 

Thr Val Phe Tyr Val Thr Asn Arg Gly Lys Lys Ala Thr Gin Leu Ala 
35 40 45 

Asp Ala Val Val Glu Glu Arg Glu Asp Gly Ala Thr Asp Val He He 
50 55 60 

Val Gly Ala Gly Val Gly Gly Ser Ala Leu Ala Tyr Ala Leu Ala Lys 
^5 70 75 80 

Asp Gly Arg Arg Val His Val He Glu Arg Asp Leu Arg Glu Pro Glu 
85 90 95 

Arg He Met Gly Glu Phe Met Gin Pro Gly Gly Arg Leu Met Leu Ser 
100 105 110 

Lys Leu Gly Leu Glu Asp Cys Leu Glu Gly He Asp Ala Gin Lys Ala 
115 120 125 

Thr Gly Met Thr Val Tyr Lys Asp Gly Lys Glu Ala Val Ala Ser Phe 
130 135 140 
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Pro Val Asp Asn Asn Asn Phe Pro Phe Asp Pro Ser Ala Arg Ser Phe 
3-45 150 155 160 

His' Asn Gly Arg Phe Val Gin Arg Leu Arg Gin Lys Ala Ser Ser Leu 
165 170 175 

Pro Asn Val Arg Leu Glu Glu Gly Thr Val Lys Ser Leu lie Glu Glu 
180 185 190 

Lys Gly Val He Lys Gly Val Thr Tyr Lys Asn Ser Ala Gly Glu Glu 
, 200 205 

Thr Thr Ala Leu Ala Pro Leu Thr Val Val Cys Asp Gly Cys Tyr Ser 
\ 210 215 220 

Asn Leu Arg Arg Ser Leu Asn Asp Asn Asn Ala Glu Val Leu Ser Tyr 
225 230 235 240 

Gin Val Gly Phe He Ser Lys Asn Cys Gin Leu Glu Glu Pro Glu Lys 
245 250 255 

Leu Lys Leu He Met Ser Lys Pro Ser Phe Thr Met Leu Tyr Gin He 
260 265 270 

Ser Ser Thr Asp Val Arg Cys Val Phe Glu Val Leu Pro Asn Asn He 
275 280 285 

Pro Ser He Ser Asn Gly Glu Met Ala Thr Phe Val Lys Asn Thr He 
290 295 300 

Ala Pro Gin Val Pro Leu Lys Leu Arg Lys He Phe Leu Lys Gly He 
305 310 315 320 

Asp Glu Gly Glu His He Lys Ala Met Pro Thr Lys Lys Met Thr Ala 

325 330 335 

thr Leu Ser Glu Lys Lys Gly Val He Leu Leu Gly Asp Ala Phe Asn 
340 345 350 

Met Arg His Pro Ala He Ala Ser Gly Met Met Val Leu Leu Ser Asp 
355 360 365 

He Leu He Leu Arg Arg Leu Leu Gin Pro Leu Ser Asn Leu Gly Asn 
370 375 380 

Ala Gin Lys He Ser Gin Val He Lys Ser Phe Tyr Asp He Arg Lys 
385 390 395 400 

Pro Met Ser Ala Thr Val Asn Thr Leu Gly Asn Ala Phe Ser Gin Val 
405 410 415 

Leu Val Ala Ser Thr Asp Glu Ala Lys Glu Ala Met Arg Gin Gly Cys 
420 425 430 

Tyr Asp Tyr Leu Ser Ser Gly Gly Phe Arg Thr Ser Gly Met Met Ala 
435 440 445 
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Leu Leu Gly Gly Met Asn Pro Arg Pro lie Ser Leu He Tyr His Leu 
450 455 460 

Cys Ala He Thr Leu Ser Ser He Gly His Leu Leu Ser Pro Phe Pro 
465 470 475 480 

Ser Pro Leu Gly He Trp His Ser Leu Arg Leu Phe Gly Leu Ala Met - 
485 490 495 

Lys Met Leu Val Pro His Leu Lys Ala Glu Gly Val Ser Gin Met Leu 
500 505 510 

Phe Pro Val Asn Ala Ala Ala Tyr Ser Lys Ser Tyr Met Ala Ala Thr 
515 520 525 

Ala Leu 
530 

<210> 6 

<211> 2038 

<212> DNA 

'<213> Arabidopsis thaliana 

<400> 6 



gaattcccgg 


gtcgacccac 


gcgtccgcgg 


acgcgtggga 


ttgagaacaa 


atagatttgg 


60 


ttatatatgg 


cttttacgca 


cgtttgttta 


tggacgttag 


tcgccttcgt 


gctgacgtgg 


12 0 


acggtgttct 


accttaccaa 


catgaagaag 


aaggcgacgg 


atttggctga 


tacggtggct 


180 


gaggatcaaa 


aagacggtgc 


tgctgacgtc 


attatcgtcg 


gggctggtgt 


aggtggttcg 


240 


gctctcgcat 


atgctcttgc 


taaggatggg 


cgtcgagtac 


atgtgatcga 


gagggacatg 


300 


agagaaccag 


aaagaatgat 


gggtgagttt 


atgcaacctg 


gcggacgact 


catgcttt-ct 


360 


aaacttggcc 


ttcaagattg 


cttggaagac 


atagatgcac 


agaaagccac 


gggtttggca 


420 


gtttataaag 


atggaaaaga 


agcagacgca 


ccttttccag 


tggataacaa 


caatttttct 


480 


tatgaacctt 


ctgctcgatc 


ttttcacaat 


ggccgattcg 


tccaacaact 


gcgtcgaaag 


540 


gctttttctc 


tttccaatgt 


gcgcctggaa 


gaaggaacgg 


tgaagtcttt 


actagaagaa 


600 


aaaggagtgg 


tcaaaggagt 


gacatacaag 


aataaagaag 


gcgaagaaac 


aacagccttg 


660 


gcacctctca 


ctgtggtatg 


cgacggttgc 


tactcaaacc 


ttcgtcggtc 


tcttaatgat 


720 


gacaacaatg 


ctgagattat 


gtcgtacata 


gttggttaca 


tctcaaagaa 


ttgtcggctt 


780 


gaagaacccg 


aaaagctaca 


cttgatattg 


tctaaaccat 


ctttcaccat 


ggtataccaa 


840 


ataagcagca 


ctgacgttcg 


ttgtggtttt 


gaggttctcc 


ccgaaaattt 


tccttctatt 


900 


gcaaatggtg 


aaatgtctac 


tttcatgaag 


aatactatag 


ttcctcaggt 


acctccaaaa 


960 


ctccgcaaaa 


tatttttgaa 


aggtatagat 


gagggagcac 


acataaaagt 


ggtgccggca 


1020 
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aagcgcatga catctacttt aagcaagaag aaaggtgtga ttgtattggg agatgcattc 10 8 0 

aatatgcgtc atccagttgt tgcatctgga atgatggttt tactgtcgga cattctcatt 114 0 

ctacgccgtc ttcttcagcc attaagcaac ctcggcgatg caaacaaagt ctcagaagtt 12 00 

atcaattcct tttatgatat ccgcaagcca atgtcggcga cggttaacac attgggaaat 12 60 

gcattttctc aagtactaat tggatcaacg gatgaagcaa aagaggcaat gagacagggt 13 2 0 

gtctatgatt acctttgtag tggcgggttt cgtacgtcag ggatgatggc tctgctcggc 13 80 

ggc'ptgaatc ctcgtcctct ctctctcgtc tatcatcttt gtgccatcac tctatcctcc 1440 

attggccaac tgctctctcc atttccctct ccccttcgca tttggcatag cctcaagctt 1500 

tttggtttgg ccatgaaaat gttggttccc aatctcaaag ctgaaggagt tagccaaatg 15 60 

ttgtttccag caaatgcagc cgcgtatcac aaaagctata tggctgcaac cactctctaa 1620 

actttgatgc tctcaatcgc aatatatatg gagcacgaat ctatgtgatt gtgcatttgg 1680 

taaacgtgta ttgcagtgct tataattatt agtatgtaac ggggaaaagt tctaaacaca 174 0 

aaaaaataaa cttttgaatg ttatatgtgt gaattatttt tgttgttaca agtaatgctc 18 00 

ttttttttta gcttcacaca tgtattattg gagctaattt tttgtttctc tgttctttta 1860 

tttttgtttt cttactgtat ttactttgaa aagtttcgtt ttatacatat tggacatttt 192 0 

ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 19 8 0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 2 038 

<210> 7 
<211> 517 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

Met Ala Phe Thr His Val Cys Leu Trp Thr Leu Val Ala Phe Val Leu 
15 10 15 

Thr Trp Thr Val Phe Tyr Leu Thr Asn Met Lys Lys Lys Ala Thr Asp 
20 25 30 

Leu Ala Asp Thr Val Ala Glu Asp Gin Lys Asp Gly Ala Ala Asp Val 
35 40 45 

He He Val Gly Ala Gly Val Gly Gly Ser Ala Leu Ala Tyr Ala Leu 
50 55 60 

Ala Lys Asp Gly Arg Arg Val His Val He Glu Arg Asp Met Arg Glu 
65 70 75 80 
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Pro Glu Arg Met Met Gly Glu Phe Met Gin Pro Gly Gly Arg Leu Met 
85 90 95 

Leu Ser Lys Leu Gly Leu Gin Asp Cys Leu Glu Asp He Asp Ala Gin 
100 105 110 

Lys Ala Thr Gly Leu Ala Val Tyr Lys Asp Gly Lys Glu Ala Asp Ala 
115 120 125 

Pro Phe Pro Val Asp Asn Asn Asn Phe Ser Tyr Glu Pro Ser Ala Arg 
130 135 140 

Ser Phe His Asn Gly Arg Phe Val Gin Gin Leu Arg Arg Lys Ala Phe 
145 150 155 160 

Ser Leu Ser Asn Val Arg Leu Glu Glu Gly Thr Val Lys Ser Leu Leu 
165 170 175 

Glu Glu Lys Gly Val Val Lys Gly Val Thr Tyr Lys Asn Lys Glu Gly 
180 185 190 

Glu Glu Thr Thr Ala Leu Ala Pro Leu Thr Val Val Cys Asp Gly Cys 
195 200 205 

Tyr Ser Asn Leu Arg Arg Ser Leu Asn Asp Asp Asn Asn Ala Glu He 
210 215 220 

Met Ser Tyr He Val Gly Tyr He Ser Lys Asn Cys Arg Leu Glu Glu 
225 230 235 240 

Pro Glu Lys Leu His Leu He Leu Ser Lys Pro Ser Phe Thr Met Val 
245 250 255 

Tyr Gin He Ser Ser Thr Asp Val Arg Cys Gly Phe Glu Val Leu Pro 
260 265 270 

Glu Asn Phe Pro Ser He Ala Asn Gly Glu Met Ser Thr Phe Met Lys 
275 280 285 

Asn Thr He Val Pro Gin Val Pro Pro Lys Leu Arg Lys He Phe Leu 
290 295 300 

Lys Gly He Asp Glu Gly Ala His He Lys Val Val Pro Ala Lys Arg 
305 310 315 320 

Met Thr Ser Thr Leu Ser Lys Lys Lys Gly Val He Val Leu Gly Asp 
325 330 335 

Ala Phe Asn Met Arg His Pro Val Val Ala Ser Gly Met Met Val Leu 
340 345 350 

Leu Ser Asp He Leu He Leu Arg Arg Leu Leu Gin Pro Leu Ser Asn 
355 360 365 

Leu Gly Asp Ala Asn Lys Val Ser Glu Val He Asn Ser Phe Tyr Asp 
370 375 380 
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lie Arg Lys Pro Met Ser Ala Thr Val Asn Thr Leu Gly Asn Ala Phe 
385 390 395 400 

Ser Gin Val Leu lie Gly Ser Thr Asp Glu Ala Lys Glu Ala Met Arg 
405 ^ 410 415 

Gin Gly Val Tyr Asp Tyr Leu Cys Ser Gly Gly Phe Arg Thr Ser Gly 
420 425 430 

Met Met Ala Leu Leu Gly Gly Met Asn Pro Arg Pro Leu Ser Leu VaL 

I 435 440 445 

1 

\ 

Tyr^ His Leu Cys Ala lie Thr Leu Ser Ser lie Gly Gin Leu Leu Ser 
450 455 460 

Pro Phe Pro Ser Pro Leu Arg He Trp His Ser Leu Lys Leu Phe Gly 
465 470 475 480 

Leu Ala Met Lys Met Leu Val Pro Asn Leu Lys Ala Glu Gly Val Ser 
485 490 495 

Gin Met Leu Phe Pro Ala Asn Ala Ala Ala Tyr His Lys Ser Tyr Met 
500 505 510 

Ala Ala Thr Thr Leu 
515 

<210> 8 
<211> 392 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 8 



aaatcatatt 


gagaacaaat 


agatttggtt 


atatatggct 


tttacgcacg 


tttgtttatg 


60 


gacgttagtc 


gccttcgtgc 


tgacgtggac 


ggtgttctac 


cttaccaaca 


tgaagaagaa 


120 


ggcgacggat 


ttggctgata 


cggtggctga 


ggatcaaaaa 


gacggtgctg 


ctgacgtcat 


180 


tatcgtcggg 


gctggtgtag 


gtggttcggc 


tctcgcatat 


gctctgctaa 


gtgtgcgcct 


240 


ggaagaagga 


acggtgaagt 


ctttactaga 


agaaaaagga 


gtggtcaaag 


gagtgacata 


300 


caagaataaa 


gaatgcgaac 


aaacaacagc 


cttggcacct 


ctcactgtgg 


tatgcgacgg 


360 


ttgctaatca 


aaccttcgtc 


ggtctcttaa 


tg 






392 



<210> 9 
<211> 110 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 9 

Met Ala Phe Thr His Val Cys Leu Trp Thr Leu Val Ala Phe Val Leu 
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Thr Trp Thr Val Phe Tyr Leu Thr Asn 
20 25 

_ Leu Ala Asp Thr Val Ala Glu Asp Gin 
35 40 

He He Val Gly Ala Gly Val Gly Gly 
50 55 

Leu Ser Val Arg Leu Glu Glu Gly Thr 
65 70 

Lys Gly Val Val Lys Gly Val Thr Tyr 
85 

Thr Thr Ala Leu Ala Pro Leu Thr Val 
100 105 



<210> 


10 


<211> 


457 


<212> 


DNA 


<213> 


Arab i dop sis t ha 1 i ana 


<220> 




<221> 


Unsure 


<222> 


(1) . . (457) 


<223> 


n=a, c, g or t 


<400> 


10 



cacaaagcaa aaaaatctct gtaaaagcag aacgataatg gagtcacaat tatggaattg 60 

gatcttacct cttttgatct cttctctcct catctccttc gtcgctttct atggattctt 120 

cgtcaaaccg aagcggaacg gtctccgtca cgatcggaaa actgtttcta ccgtcacctc 18 0 

cgacgtcgga tctgttaata ttaccggaga tactgtcgct gatgtcattg ttgttggagc 240 

fcggtgttgct ggttctgctc ttgcttatac tcttggaaag gggaaattta aacgccgagt 300 

tcatgtgatt gaaagagatt tatcggagcc tgatcgtatt gttggggagt tgttacagcc 360 

tgngggttac ctcaagttac tggagtgtgg aattggagat tgtgtggaag aaatagatgc 420 

tcagcntgtg tatggttatg cactttttaa aaatggg 457 



<210> 11 

<211> 152 

<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> UNSURE 

<222> (1) . . (152) 



10 15 

Met Lys Lys Lys Ala Thr Asp 
30 

Lys Asp Gly Ala Ala Asp Val 
45 

Ser Ala Leu Ala Tyr Ala Leu 
60 

Val Lys Ser Leu Leu Glu Glu 
75 80 

Lys Asn Lys Glu Cys Glu Gin 
90 95 

Val Cys Asp Gly Cys 
110 
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<223> X=any amino acid 
<400> 11 

Thr Lys Gin. Lys Asn Leu Cys Lys Ser Arg Thr He Met Glu Ser Gin 
1-5 10 15 

Leu Trp Asn Trp He Leu Pro Leu Leu He Ser Ser Leu Leu He Ser 
20 25 30 

Ph^ Val Ala Phe Tyr Gly Phe Phe Val Lys Pro Lys Arg Asn Gly Leu 
\ 35 40 45 

\ 

Arg His Asp Arg Lys Thr Val Ser Thr Val Thr Ser Asp Val Gly Ser 
50 55 60 

Val Asn He Thr Gly Asp Thr Val Ala Asp Val He Val Val Gly Ala 
65 70 75 80 

Gly Val Ala Gly Ser Ala Leu Ala Tyr Thr Leu Gly Lys Gly Lys Phe 
85 90 95 

Lys Arg Arg Val His Val He Glu Arg Asp Leu Ser Glu Pro Asp Arg 
100 105 110 

He Val Gly Glu Leu Leu Gin Pro Xaa Gly Tyr Leu Lys Leu Leii Glu 
115 120 125 

Cys Gly He Gly Asp Cys Val Glu Glu He Asp Ala Gin Xaa Val Tyr 
130 135 140 

Gly Tyr Ala Leu Phe Lys Asn Gly 
145 150 

<210> 12 

<;211> 1486 

<212> DNA 

<213> Arabidopsis thaliana 

<400> 12 



cgtgttttac 


aaatttcctt 


tgttggtttt 


ccacagattt 


aaagaaccct: 


aacgagagaa 


60 


aaaaatggac 


tgggattact 


atacgctgtt 


gaagacgagt 


gtggctatta 


ttatagtgtt 


120 


tgttgtggcc 


aaactcataa 


cctcctccaa 


atccaagaag 


aaaacaagtg 


tcgtcccact 


180 


ccctccagtt 


cttcaagcgt 


ggcctccatt 


tatcggatcc 


ctaatccgct 


tcatgaaagg 


240 


tccaatagtg 


ctacttagag 


aggaatatcc 


taagcttgga 


agtgttttca 


cagtgaagct 


300 


tcttcacaaa 


aacatcactt 


ttctcatcgg 


tcccgaagtc 


tcgtcccact 


ttttcaacgc 


360 


ttatgaatct 


gaactcagcc 


agaaagaaat 


ttacaaattt 


aatgtgccta 


cttttggccc 


420 


cggagttgtg 


tttgatgttg 


actatcccgt 


tcggatggag 


cagttccgat 


tcttctccag 


480 
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cgctctcaag gat tact tct caaaatgggg agaaagtggg gaagtggatc taaaggccga 540 

gttagagcgt ctaatcacct tgactgctag tagatgtcta ttgggtcgag aagtccgtga 600 

ccaacttttt gatgatgttg ctccattgtt ccatgacctt gataaaggca tgcaacccat 660 

aagtgtcatc ttcccaaagc tccccattcc agctcacaat tgtcgtgacc gtgctcgcgg 720 

aaagattgca aaaatctttt caaacatcat agcaacaaga aaacgctctg gtgacaaatc 780 

agagaacgac atgctacaat gtttcatcga ctcaaagtac aaagacggta gagagacaac 84 0 

tgaatctgaa gtaactggtt tgctcattgc tggtttgttt gcaggacaac atacaagctc 900 

tatcactgcc acatggaccg gtgcttatct aattcaaaac aaacactggt ggtccgcggc 960 

tttggacgag cagaagaaac tgattggaaa acatggggac aagatcgact acgatgtttt 102 0 

gtctgagatg gattttctgt ttcgcagtgc aaaagaagct ttaaggcttc accctccaaa 1080 

gatcttactg ctgagaacag tacacagtga tttcaccgtg acaactcgag aaggaaagca 1140 

atatgagata ccaaagggtc atatcgttgc aacttctcct gcattcgcca accgcttacc 1200 

tcatgtctac aaagatccgg aaaattttga tccggataga ttttcaaagg aaagagaaga 1260 

ggataaagca gctggttcgt gttcatacat ctctttggga gctggtaggc acgagtgtcc 1320 

tggtggatca tttgcgttct tgcagatcaa agccgtatgg tgtcacttat tgagaaactt 13 80 

tgagcttgag ttagtgtcac ctttccctga aatcaactgg aatgctttgg tcgttggtgc 1440 

taaaggaaat gtcatggttc gttacaagcg tcgtcccttt tcttaa 1486 

<210> 13 
<211> 473 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 13 

Met Asp Trp Asp Tyr Tyr Thr Leu Leu Lys Thr Ser Val Ala lie lie 
15 10 15 

He Val Phe Val Val Ala Lys Leu He Thr Ser Ser Lys Ser Lys Lys 
20 25 30 

Lys Thr Ser Val Val Pro Leu Pro Pro Val Leu Gin Ala Trp Pro Pro 
35 40 45 

Phe He Gly Ser Leu He Arg Phe Met Lys Gly Pro He Val Leu Leu 
50 55 60 

Arg Glu Glu Tyr Pro Lys Leu Gly Ser Val Phe Thr Val Lys Leu Leu 
65 70 75 80 
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His Lys Asn lie Thr Phe Leu lie Gly Pro Glu Val Ser Ser His Phe 
85 90 95 

Phe Asn Ala Tyr Glu Ser Glu Leu Ser Gin Lys Glu lie Tyr Lys Phe 
100 105 110 

Asn Val Pro Thr Phe Gly Pro Gly Val Val Phe Asp Val Asp Tyr Pro 
115 120 125 

Val Arg Met Glu Gin Phe Ar-g Phe Phe Ser Ser Ala Leu Lys Asp Tyr 

130 135 140 

\ 

\ Phe Ser Lys Trp Gly Glu Ser Gly Glu Val Asp Leu Lys Ala Glu Leu 

145 150 155 160 

Glu Arg Leu lie Thr Leu Thr Ala Ser Arg Cys Leu Leu Gly Arg Glu 
165 170 175 

Val Arg Asp Gin Leu Phe Asp Asp Val Ala Pro Leu Phe His Asp Leu 
180 185 190 

Asp Lys Gly Met Gin Pro lie Ser Val lie Phe Pro Lys Leu Pro . lie 
195 200 205 

Pro Ala His Asn Cys Arg Asp Arg Ala Arg Gly Lys lie Ala Lys lie 
210 215 . 220 

Phe Ser Asn lie lie Ala Thr Arg Lys Arg Ser Gly Asp Lys Ser Glu 
225 230 235 240. 

Asn Asp Met Leu Gin Cys Phe lie Asp Ser Lys Tyr Lys Asp Gly Arg 
245 250 255 

Glu Thr Thr Glu Ser Glu Val Thr Gly Leu Leu lie Ala Gly Leu Phe 
260 265 270 

Ala Gly Gin His Thr Ser Ser lie Thr Ala Thr Trp Thr Gly Ala Tyr 
275 280 285 

Leu lie Gin Asn Lys His Trp Trp Ser Ala Ala Leu Asp Glu Gin Lys 
290 295 300 

Lys Leu lie Gly Lys His Gly Asp Lys lie Asp Tyr Asp Val Leu Ser 
305 310 315 320 

Glu Met Asp Phe Leu Phe Arg Ser Ala Lys Glu Ala Leu Arg Leu His 

325 330 335 

Pro Pro Lys lie Leu Leu Leu Arg Thr Val His Ser Asp Phe Thr Val 
340 345 350 

Thr Thr Arg Glu Gly Lys Gin Tyr Glu lie Pro Lys Gly His lie Val 
355 360 365 

Ala Thr Ser Pro Ala Phe Ala Asn Arg Leu Pro His Val Tyr Lys Asp 
370 375 3B0 
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Pro Glu Asn Phe Asp Pro Asp 
385 390 



Arg Phe Ser Lys Glu Arg Glu Glu Asp 
395 400 



Lys Ala Ala Gly Ser Cys Ser 
405 



Tyr lie Ser Leu Gly Ala Gly Arg His 
410 415 



Glu Cys Pro Gly Gly Ser Phe 
420 



Ala Phe Leu Gin lie Lys Ala Val Trp 
425 430 



Cys His Leu Leu Arg Asn Phe 
435 



Glu Leu Glu Leu Val Ser Pro Phe Pro 
440 445 



Glu lie Asn Trp Asn Ala Leu 
450 455 



Val Val Gly Ala Lys Gly Asn Val Met 
460 



Val Arg Tyr Lys Arg Arg Pro 
465 470 



Phe Ser 



<210> 14 
<211> 523 
<212> DNA 

'<213> Arabidopsis thaliana 
<400> 14 

gacactatag aagagctatg acgtcgcatg cacgcgtacg taagctcgga attcggctcg 60 
agcttgttca caaaaagatt acttttctta ttggtcctga agtctctgct cattttttca 120 
aagcttctga atctgatctt agtcagcagg aagtgtatca gttcaatgtc cctacttttg 180 
gtcctggagt tgttttcgat gttgattatt ctgtttcgtc aggagcagtt cggttcttca 240 
ctgaggcact tagagttaac aagttgaagg gttatgtgga tatgatggtt actgaagctg 300 
aggattactt ctctaaatgg ggagagagtg gtgaagttga tattaaggtt gagctagaga 3 60 
ggctcatcat cttgactgca agtgatgttt actgggtcga gaagttcgtg atcagctttt 420 
tgatgatgtc tctgctttgt tccatgacct tgacaatgga atgcttccca tcagtgcttc 480 
ccatcagtgt tctcttccca tatctcccaa ttccagctca ccg 523 



<210> 15 
<211> 87 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 15 

His Tyr Arg Arg Ala Met Thr Ser His Ala Arg Val Arg Lys Leu Gly 
15 10 15 

lie Arg Leu Glu Leu Val His Lys Lys lie Thr Phe Leu lie Gly Pro 
20 25 30 

Glu Val Ser Ala His Phe Phe Lys Ala Ser Glu Ser Asp Leu Ser Gin 
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35 40 45 

Gin Glu Val Tyr Gin Phe Asn Val Pro Thr Phe Gly . Pro Gly Val Val 
.50 55 6Q 

Phe Asp Val Asp Tyr Ser Val Arg Gin Glu Gin Phe Gly Ser Ser Leu 
65 70 75 - 80 

Arg His Leu Glu Leu Thr Ser 
85 

<210> 16 
<211> 1852 
<2iC2> DNA 

<213> Arabidopsis thaliana • 
<400> 16 

tcgaccccgc gtccgcggac gcgtgggatc agcttcaagc ttaagagagc ttcgaaagcg 60 

aaagcgacga tttcttctcc atcgtgagag caaatctcca gagccgtttt ctcttcttct 120 

tcttcctcct cgcgccgtct ctgaaactcc atcatcgtat caatcaaatt gcttcctcct 180 

ccaaattgaa aaacaatgga attggattcg gagaacaaat tgttgaagac gggtttggtt 24 0 

atagtggcga cacttgttat agccaaactc atcttctctt tcttcacttc tgattctaag 3 00 

aagaagcgtc ttcctcctac tcttaaagct tggcctccat tggttggaag tcttatcaaa 360 

ttcttgaaag gacctattat tatgcttaga gaggaatacc ctaagcttgg aagtgtgttt 42 0 

actgttaatc ttgttcacaa aaagattact tttcttattg gtcctgaagt ctctgctcat 4 80 

tttttcaaag cttctgaatc tgatcttagt cagcaggaag tgtatcagtt caatgtccct 540 

acttttggtc ctggagttgt tttcgatgtt gattattctg ttcgtcagga gcagtttcgg 600 

ttcttcactg aggcacttag agttaacaag ttgaagggtt atgtggatat gatggttact 660 

gaagctgagg at tact t etc taaatgggga gagagtggtg aagttgatat taaggttgag 720 

ctagagaggc tcatcatctt gactgcaagt agatgtttac tgggtcgaga agttcgtgat 780 

cagctttttg atgatgtctc tgctttgttc catgaccttg acaatggaat gcttcccatc 840 

agtgttctct tcccatatct cccaattcca gctcaccgcc gtcgtgaccg tgcccgagaa 900 

aagctttcgg agattttcgc aaaaatcatt gggtcgagaa aacgctctgg aaaaacagag 960 

aacgacatgc tgcagtgttt catcgaatca aagtacaaag atggtagaca gacaaccgaa 1020 

tctgaagtca ctggtttgct cattgctgct ctgtttgcag gacaacacac gagctctatc 1080 

acttccacct ggaccggtgc ttatctgatg cgatacaaag agtacttctc agctgctctt 1140 

gatgagcaga agaacctgat tgcgaaacat ggagacaaga tcgatcatga tatcttatcc 1200 
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gagatggatg 


ttctctaccg 


ctgcattaag 


gaagcgttga 


ggcttcaccc 


tccactcatc 


1260 


atgttaatga 


gagcctcgca 


cagtgatttc 


agcgtgacag 


ctcgggatgg 


aaaaacttac 


1320 


gatatcccaa 


agggtcacat 


cgttgcaacc 


tcccctgcat 


ttgccaaccg 


cttaccgcac 


1380 


atcttcaaag 


accccgacac 


ctacgaccca 


gaaagattct 


cccctggaag 


agaagaggac 


1440 


aaagccgcag 


gggcattctc 


gtacattgca 


ttcggagggg 


gaaggcacgg gtgccttgga 


1500 


gagccgtttg 


cttacctgca 


gatcaaagcc 


atatggagtc 


atttgttgag 


gaacttcgag 


1560 


cttgagctag 


tttcaccgtt 


ccctgagatt 


gactggaacg ctatggtg'gt 


tggagttaaa 


1620 


ggcaatgtga 


tggtgcgtta 


caagaggcgc 


cagctttctt aaagacaagt ttaaggttat 


1680 


tgcagctttg 


gatttttctc 


tctggtttct 


gctttgcttt 


tgtccctctc 


tggttttagt 


1740 


tttgttgttg 


aataattctt 


ctgtttttat 


aaactgttgt 


tactctttaa 


ttgacattta 


1800 


tttttaagct 


tcctaagttt 


gtggttcaaa 


aaaaaaaaaa 


ggcggcgtta 


ct 


1852 



<210> 17 
<211> 488 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 17 

Met Glu Leu Asp Ser Glu Asn Lys Leu Leu Lys Thr Gly Leu Val lie 
15 10 15 

Val Ala Thr Leu Val lie Ala Lys Leu lie Phe Ser Phe Phe Thr Ser 
20 25 30 

Asp Ser Lys Lys Lys Arg Leu Pro Pro Thr Leu Lys Ala Trp Pro Pro 
35 40 45 

Leu Val Gly Ser Leu lie Lys Phe Leu Lys Gly Pro He He Met Leu 
50 55 60 

Arg Glu Glu Tyr Pro Lys Leu Gly Ser Val Phe Thr Val Asn Leu Val 
65 70 75 80 

His Lys Lys He Thr Phe Leu He Gly Pro Glu Val Ser Ala His Phe 
85 90 95 

Phe Lys Ala Ser Glu Ser Asp Leu Ser Gin Gin Glu Val Tyr Gin Phe 
100 105 110 

Asn Val Pro Thr Phe Gly Pro Gly Val Val Phe Asp Val Asp Tyr Ser 
115 120 125 

Val Arg Gin Glu Gin Phe Arg Phe Phe Thr Glu Ala Leu Arg Val Asn 
130 135 140 
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Lys Leu Lys Gly Tyr Val Asp Met Met Val Thr Glu Ala Glu Asp Tyr 
145 150 155 160 

Phe- Ser Lys Trp Gly Glu Ser Gly Glu Val Asp He Lys Val Glu Leu 
165 170 175 

Glu Arg Leu He He Leu Thr Ala Ser Arg Cys Leu Leu Gly Arg Glu 
180 185 190 

Val Arg Asp Gin Leu Phe Asp Asp Val Ser Ala Leu Phe His Asp Leu 
^ 195 200 205 

AsE> Asn Gly Met Leu Pro He Ser Val Leu Phe Pro Tyr Leu Pro He 
\ 210 215 220 

Pro Ala His Arg Arg Arg Asp Arg Ala Arg Glu Lys Leu Ser Glu He 
225 230 235 240 

Phe Ala Lys He He Gly Ser Arg Lys Arg Ser Gly Lys Thr Glu Asn 
245 250 255 

Asp Met Leu Gin Cys Phe He Glu Ser Lys Tyr Lys Asp Gly Arg Gin 
260 265 270 

Thr Thr Glu Ser Glu Val Thr Gly Leu Leu He Ala Ala Leu Phe Ala 
275 280 285 

Gly Gin His Thr Ser Ser He Thr Ser Thr Trp Thr Gly Ala Tyr Leu 
290 295 300 

Met Arg Tyr Lys Glu Tyr Phe Ser Ala Ala Leu Asp Glu Gin Lys Asn 
305 310 315 320 

Leu He Ala Lys His Gly Asp Lys He Asp His Asp He Leu Ser Glu 
325 330 335 

Met Asp Val Leu Tyr Arg Cys He Lys Glu Ala Leu Arg Leu His Pro 
340 345 350 

Pro Leu He Met Leu Met Arg Ala Ser His Ser Asp Phe Ser Val Thr 
355 360 365 

Ala Arg Asp Gly Lys Thr Tyr Asp He Pro Lys Gly His He Val Ala 
370 375 380 

Thr Ser Pro Ala Phe Ala Asn Arg Leu Pro His He Phe Lys Asp Pro 
385 390 395 400 

Asp Thr Tyr Asp Pro Glu Arg Phe Ser Pro Gly Arg Glu Glu Asp Lys 
405 410 415 

Ala Ala Gly Ala Phe Ser Tyr He Ala Phe Gly Gly Gly Arg His Gly 
420 425 430 

Cys Leu Gly Glu Pro Phe Ala Tyr Leu Gin He Lys Ala He Trp Ser 
435 440 445 
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His Leu Leu Arg Asn Phe Glu Leu Glu Leu Val Ser Pro Phe Pro Glu 
450 455 460 

lie Asp Trp Asn Ala Met Val Val Gly Val Lys Gly Asn Val Met Val 
465 470 475 480 

Arg Tyr Lys Arg Arg Gin Leu Ser — 
485 

<210> 18 
<211> 1852 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 18 

tcgaccccgc gtccgcggac gcgtgggatc agcttcaagc ttaagagagc ttcgaaagcg 60 

aaagcgacga tttcttctcc atcgtgagag caaatctcca gagccgtttt ctcttcttct 120 

tcttcctcct cgcgccgtct ctgaaactcc atcatcgtat caatcaaatt gcttcctcct ISO 

bcaaattgaa aaacaatgga attggattcg gagaacaaat tgttgaagac gggtttggtt 240 

atagtggcga cacttgttat agccaaactc atcttctctt tcttcacttc tgattctaag 3 00 

aagaagcgtc ttcctcctac tcttaaagct tggcctccat tggttggaag tcttatcaaa 360 

ttcttgaaag gacctattat tatgcttaga gaggaatacc ctaagcttgg aagtgtgttt 42 0 

actgttaatc ttgttcacaa aaagattact tttcttattg gtcctgaagt ctctgctcat 4 80 

tttttcaaag cttctgaatc tgatcttagt cagcaggaag tgtatcagtt caatgtccct 54 0 

acttttggtc ctggagttgt tttcgatgtt gattattctg ttcgtcagga gcagtttcgg 600 

ttcttcactg aggcacttag agttaacaag ttgaagggtt atgtggatat gatggttact 660 

gaagctgagg attacttctc taaatgggga gagagtggtg aagttgatat taaggttgag 72 0 

ctagagaggc tcatcatctt gactgcaagt agatgtttac tgggtcgaga agttcgtgat 7 80 

cagctttttg atgatgtctc tgctttgttc catgaccttg acaatggaat gcttcccatc 840 

agtgttctct tcccatatct cccaattcca gctcaccgcc gtcgtgaccg tgcccgagaa 900 

aagctttcgg agattttcgc aaaaatcatt gggtcgagaa aacgctctgg aaaaacagag 960 

aacgacatgc tgcagtgttt catcgaatca aagtacaaag atggtagaca gacaaccgaa 1020 

tctgaagtca ctggtttgct cattgctgct ctgtttgcag gacaacacac gagctctatc 1080 

acttccacct ggaccggtgc ttatctgatg cgatacaaag agtacttctc agctgctctt 1140 

gatgagcaga agaacctgat tgcgaaacat ggagacaaga tcgatcatga tatcttatcc 12 00 

gagatggatg ttctctaccg ctgcattaag gaagcgttga ggcttcaccc tccactcatc 12 60 
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atgttaatga gagcctcgca cagtgatttc agcgtgacag ctcgggatgg aaaaacttac X320 
gatatcccaa agggtcacat cgttgcaacc tcccctgcat ttgccaaccg cttaccgcac 1380 
atcttcaaag accccgacac ctacgaccca gaaagattct cccctggaag agaagaggac .144 0 
aaagccgcag gggcattctc gtacattgca ttcggagggg gaaggcacgg gtgccttgga 1500 — 
gagccgtttg cttacctgca gatcaaagcc atatggagtc atttgttgag gaacttcgag 1560 
cttgagctag tttcaccgtt ccctgagatt gactggaacg ctatggtggt tggagttaaa 1620 

ggcaatgtga tggtgcgtta caagaggcgc cagctttctt aaagacaagt ttaaggttat 1680 

\ 

tgcagctttg gatttttctc tctggtttct gctttgcttt tgtccctctc tggttttagt 1740 

tttgttgttg aataattctt ctgtttttat aaactgttgt tactctttaa ttgacattta 1800 

tttttaagct tcctaagttt gtggttcaaa aaaaaaaaaa ggcggcgtta ct 1852 

<210> 19 
<211> 488 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 19 

Met Glu Leu Asp Ser Glu Asn Lys Leu Leu Lys Thr Gly Leu Val He 
15 10 15 

Val Ala Thr Leu Val He Ala Lys Leu He Phe Ser Phe Phe Thr Ser 
20 25 30 

Asp Ser Lys Lys Lys Arg Leu Pro Pro Thr Leu Lys Ala Trp Pro Pro 
35 40 ' 45 

Leu Val Gly Ser Leu He Lys Phe Leu Lys Gly Pro He He Met Leu 
50 55 60 

Arg Glu Glu Tyr Pro Lys Leu Gly Ser Val Phe Thr Val Asn Leu Val 
65 70 75 80 

His Lys Lys He Thr Phe Leu He Gly Pro Glu Val Ser Ala His Phe 
85 90 95 

Phe Lys Ala Ser Glu Ser Asp Leu Ser Gin Gin Glu Val Tyr Gin Phe 
100 105 110 

Asn Val Pro Thr Phe Gly Pro Gly Val Val Phe Asp Val Asp Tyr Ser 
115 120 125 

Val Arg Gin Glu Gin Phe Arg Phe Phe Thr Glu Ala Leu Arg Val Asn 
130 135 140 

Lys Leu Lys Gly Tyr Val Asp Met Met Val Thr Glu Ala Glu Asp Tyr 
145 150 155 160 
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Phe Ser Lys Trp Gly Glu Ser Gly Glu Val Asp lie liys Val Glu Leu 
165 170 175 

Glu Arg Leu He He Leu Thr Ala Ser Arg Cys Leu Leu Gly Arg Glu 
180 185 190 

Val Arg Asp Gin Leu Phe Asp Asp Val Ser Ala Leu Phe His Asp Leu 
195 200 205 

Asp Asn Gly Met Leu Pro He Ser Val Leu Phe Pro Tyr Leu Pro He 
210 215 220 

Pro Ala His Arg Arg Arg Asp Arg Ala Arg Glu Lys Leu Ser Glu He 
225 230 235 240 

Phe Ala Lys He He Gly Ser Arg Lys Arg Ser Gly Lys Thr Glu Asn 
245 250 255 

Asp Met Leu Gin Cys Phe He Glu Ser Lys Tyr Lys Asp Gly Arg Gin 
260 265 270 

Thr Thr Glu Ser Glu Val Thr Gly Leu Leu He Ala Ala Leu Phe Ala 
275 280 285 

Gly Gin His Thr Ser Ser He Thr Ser Thr Trp Thr Gly Ala Tyr Leu 
290 295 300 

Met Arg Tyr Lys Glu Tyr Phe Ser Ala Ala Leu Asp Glu Gin Lys Asn 
305 310 315 320 

Leu He Ala Lys His Gly Asp Lys He Asp His Asp He Leu Ser Glu 
325 330 335 

Met Asp Val Leu Tyr Arg Cys He Lys Glu Ala Leu Arg Leu His Pro 
340 345 350 

Pro Leu He Met Leu Met Arg Ala Ser His Ser Asp Phe Ser Val Thr 
355 360 365 

Ala Arg Asp Gly Lys Thr Tyr Asp He Pro Lys Gly His He Val Ala 
370 375 380 

Thr Ser Pro Ala Phe Ala Asn Arg Leu Pro His He Phe Lys Asp Pro 
385 390 395 400 

Asp Thr Tyr Asp Pro Glu Arg Phe Ser Pro Gly Arg Glu Glu Asp Lys 
405 410 415 

Ala Ala Gly Ala Phe Ser Tyr He Ala Phe Gly Gly Gly Arg His Gly 
420 425 430 

Cys Leu Gly Glu Pro Phe Ala Tyr Leu Gin He Lys Ala He Trp Ser 
435 440 445 

His Leu Leu Arg Asn Phe Glu Leu Glu Leu Val Ser Pro Phe Pro Glu 
450 455 460 
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lie Asp Trp Asn Ala Met Val Val Gly Val Lys Gly Asn Val Met Val 
465 470 475 480 

Arg. Tyr Lys Arg Arg Gin Leu Ser 
485 

<210> 20 
<211> 1249 
<212> DNA 

<213> Arabidopsis thaliana 

I 

<4qo> 20 

ctt:\tctccct gtgaaaaaat ggactcggtg gctctctact gcaccgctgg tctcattgcc 60 

ggcgccgtct actggttcat atgcgtccta ggtccagcag aacgaaaagg caaacgagcc 12 0 

tctgatctct ccggcggctc aatctccgca gaaaaagtca aagacaacta taaccaatac 180 

tggtctttct tccgcaaacc aaaagagatc gaatcagccg agaaagtacc tgacttcgtc 240 

gacacgttct acaatcttgt cactgatacc tacgagtggg gatggggaca atctttccat 3 00 

ttctctcctc atgtccctgg aaaatccgac aaagacgcca caagaatcca cgaagaaatg 3 60 

gccgtcgatc tcatcaaagt gaaaccggga caaaagattc ttgacgctgg ttgcggcgtg 420 

ggtgggccga tgagagccat cgcggcccat tccaaggccc aagtcactgg aatcactatc 4 80 

aacgagtacc aagtgcaacg agccaagctt cacaacaaga aagctggact tgattctctc 540 

tgcaacgtcg tttgtggtaa ctttttaaag atgccgttcg atgaaaacac gtttgacgga 600 

gcttactcga tcgaagctac gtgtcacgct cctaagctcg aagaagtata ctcggagatc 660 

ttcagagtga tgaaaccagg atctttgttc gtgtcctacg aatgggtcac cactgaaaaa 720 

tacagagacg atgacgaaga acacaaggac gtgattcaag ggatcgagag aggagacgca 780 

cttcctggac taagaagcta cgctgatata gccgtgacgg cgaagaaagt tgggtttgag 84 0 

gtagtgaagg agaaagattt ggctaaacca ccgtctaaac cgtggtggaa ccggttaaag 900 

atgggaagga ttgcttattg gagaaaccat gttgtggttg tgattctttc tgctattggg 960 

gttgctccta aaggaactgt tgatgttcat aagatgttgt ttaagactgc tgattatttg 102 0 

accagaggtg gtgagactgg aatcttctct ccgatgcata tgattctctg tagaaaacca 1080 

gagaaagctt ctgaatgaat gattgagaat acttcttcct tgttctcgtt ttcttcttct 1140 

ttctttctaa gttcatgttt ttccccttaa gaatctcttt gtccgtcgta ttaatgttat 1200 

cactttgttg tttattgtat tttttttttt caatttgcta aattactcc 1249 

<210> 21 
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<211> 1444 

<212> DNA 

<213> Nicotiana tabacum 

<400> 21 







tccuctcttg 


aaaggtgaaa 


ggttctctcc 


aagaatacag 


60 


a tf^S 4" ^1 M A. ^ 

a^aucccucc 


tcbacataga 


ttttgtgtat 


atcttgtgat 


ttigggaaaga 


aatgtcaaaa 


120 


^ ^^^^^^^^ ^» ^ 

caaGfQ9^cut: 


uugaccuggc 


atctggggtt 


ggtggcaaaa 


ttiaacaagga 


ggaagttctc 


180 


t C t Cf C t^Gf 1 1 Of 


acaagt^atga 


gaagtaccat 


ggttattatg 


gaggtgaaga 


agaagagaga 


240 


aagaatiaac^ 


atactgacat 


ggttaacaaa 


tactatgatc 


tttgcactag 


cttctacgaa 


300 


tacggctggg 


gagagtcatt 


ccattttgca 


cccaggtgga 


aaggagaatc 


actccaagag 


360 


agcattaaaa 


ggcatgagca 


ctttcttgcc 


ttgcaactgg 


gattgaaacc 


aggacaaaag 


420 


gtcttggacg 


taggatgtgg 


aattggtggg 


ccgttaagag 


aaattgctcg 


attcagctct 


480 


acatcagt:ta 


caggcctcaa 


caataatgaa 


tatcagatat 


ctaggggaca 


ggtgttgaac 


540 


cgcaaagtag 


gattggatca 


gacttgcaac 


tttgtaaagg 


gtgatttcat 


gaaaatgcca 


600 


ttccctgaca 


atagctttga 


tgcagtgtac 


gcaatagaag 


ctacctgcca 


tgcaccagat 


660 


ccattgggat 


gctataaaga 


gatttaccgg 


gtgctgaagc 


ctggtcaatg 


tttcgctgtg 


720 


tatgagtggt 


gcatgaccga 


ttcttacaac 


cccaa^aacg 


aagagcacaa 


caggatcaag 


.780 


gccgaaattg 


agctcggaaa 


tggcctccct 


gaggttagat 


tgacaacacia 


gtgcctcgaa 


840 


gcagccaaac 


aagctggttt 


tgaagttgta 


t999acaagg 


atctggctga 


tgactcacct 


900 


gttccatggt 


acttgccttt 


ggatacgagt 


cacttctcgc 


tcagtagctt 


ccgcctaaca 


960 


gcagttggca 


gacttttcac 


cagaaatctg 


gtttcggcgc 


ttgaatacgt 


gggacttgct 


1020 


cctaaaggta 


gtcaaagggt 


tcaagctttc 


ttagagaaag 


ctgcagaagg 


tcttgtcggt 


1080 


ggtgccaaga 


aagggatttt 


cacaccaatg 


tacttcttcg 


tggttcgcaa 


gcccatttca 


1140 


gactctcagt 


aatatggagt 


ttagtcactt 


agctttttgc 


tttagctagc 


aaatctgtaa 


1200 


gattcttcgc 


acagaacttt 


acacattgaa 


tatgaccgcc 


ctaatitaagg 


tgactacagt 


1260 


ttttggaggg 


cgttgtgggt 


ggagggtttc 


tttttctgtg 


ttgcttgtct 


ggcacaattt 


1320 


gatttcatgt 


cttgctattt 


ttgccattga 


tgtccttgtt 


ctaagatata 


tacctattga 


1380 


caagctcata 


aaggtgggca 


tttgctaata 


tatggtgttt 


caggtaaaaa 


aaaaaaaaaa 


1440 


aaaa 












1444 



<210> 22 
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<211> 1421 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 22 



ctctctctct 


ctctctcttg 


gtcttcctca 


ctcttaacga 


aaatggactc 


tttaacactc 


60 


ttcttcaccg 


gtgcactcgt 


cgccgtcggt 


atctactggt 


tc'ctctgcgt 


tctcggtcca 


120 


gcagagcgta 


aaggcaaacg 


agccgtagat 


ctctctggtg 


gctcaatctc 


cgccgagaaa 


180 


gtccaagaca 


actacaaaca 


gtactggtct 


ttcttccgcc 


gtccaaaaga 


aatcgaaacc 


240 


gccgagaaag 


ttccagactt 


cgtcgacaca 


ttctacaatc 


tcgtcaccga 


catatacgag 


300 


tggggatggg 


gacaatcctt 


ccacttctca 


ccatcaatcc 


ccggaaaatc 


tcacaaagac 


360 


gccacgcgcc 


tccacgaaga 


gatggcggta 


gatctgatcc 


aagtcaaacc 


tggtcaaaag 


420 


atcctagacg 


tcggatgcgg 


tgtcggcggt 


ccgatgcgag 


cgattgcatc 


tcactcgcga 


480 


gctaacgtag 


tcgggattac 


aataaacgag 


tatcaggtga 


acagagctcg 


tctccacaat 


540 


aagaaagctg 


gtctcgacgc 


gctttgcgag 


gtcgtgtgtg 


gtaacttcct 


ccagatgccg 


600 


ttcgatgaca 


acagtttcga 


cggagcttat 


tccatcgaag 


ccacgtgtca 


cgcgccgaag 


660 


ctggaagaag 


tgtacgcaga 


gatctacagg 


gtgttgaaac 


ccggatctat 


gtatgtgtcg 


720 


tacgagtggg 


titacgacgga 


gaaatttaag 


gcggaggatg 


acgaacacgt 


ggaggtaatc 


780 


caagggattg 


agagaggcga 


tgcgttacca 


gggcttaggg 


cttacgtgga 


tatagctgag 


840 


acggctaaaa 


aggttgggtt 


tgagatagtg 


aaggagaagg 


atctggcgag 


tccaccggct 


900 


gagccgtggt 


ggactaggct 


taagatgggt 


aggcttgctt 


attggaggaa 


tcacattgtg 


960 


gttcagattt 


tgtcagcggt 


tggagttgct 


cctaaaggaa 


ctgttgatgt 


tcatgagatg 


1020 


ttgtttaaga 


ctgctgattg 


tttgaccaga 


ggaggtgaaa 


ccggaatatt 


ctctccgatg 


1080 


catatgattc 


tctgcagaaa 


accggagtca 


ccggaggaga 


gttcttgaga 


aaggtagaaa 


1140 


ggaaacatca 


ccggaaaaag 


tatggagaat 


tttctcaatt 


tgtttttatt 


tttaagttaa 


1200 


atcaacttgg 


ttattgtact 


atttttgtgt 


tttaatttgg 


tttgtgtttc 


aagaattatt 


1260 


agtttttttt 


tgttttgttg 


catatgagaa 


tcttactctt 


gatttctccg 


ccgtagagcc 


1320 


ggcgagacat 


aggggattat 


tagtattttt 


aagtgtgttt 


aagattgatt 


aacaagttag 


1380 


taaaataaaa 


tgtacttagg 


tgtcgaaaaa 


aaaaggaatt 


c 




1421 



<210> 23 
<211> 1175 
<212> DNA 
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<213> Arabidopsis thaliana 
<400> 23 



cagtgtgagt 


aatttagcat 


tactactgtt 


gacttgttca 


ataaaggtaa 


agtaagatca 


60 


atccggcgca 


atcttctttc 


gttttccggc 


accgatctcg 


gtggatctcc 


gattcacatg 


12 0 


gcggcggata 


atgcttatct 


gatgcagttt 


gttgacgaaa 


cctcttttta 


caaccgaatc 


180 


gttctgagtc 


atcttttgcc 


ggcgaatcta 


tgggaaccct 


tacctcattt 


tctocagaca 


240 


tggctccgaa 


attacctcgc 


cggaacccta 


ctatacatca 


tctccggttt 


cctctggtgc 


300 


ttctacatct 


attaccgtaa 


aatcaacgtt 


taccttccca 


aagatgcaat 


tcctacaata 


360 


aaggctatgc 


gtttgcaaat 


gtttgtggca 


atgaaggcta 


tgccatggta 


cactcttctt 


420 


ccaactgtct 


ccgagagtat 


gattgaacgt 


ggttggacca 


aatgttttgc 


tagcataggc 


480 


gaattcggtt 


ggattctgta 


ttttgtttac 


atcgccatct 


atcttgtttt 


cgttgagttt 


540 


ggtatttatt 


ggatgcacag 


agagcttcat 


gacattaagc 


ctctctataa 


gtatctccat 


600 


gccacccatc 


atatctacaa 


caagcagaat 


acactctctc 


catttgccgg 


gcttgcattt 


660 


cacccagtag 


acgggatact 


tcaggctgta 


ccgcatgtga 


tagcgctgtt 


tatagtgcca 


720 


attcatttca 


caactcatat: 


aggtcttttg 


ttcatggaag 


cgatatgggc 


ggcgaacatc 


780 


catgactgca 


tccatggcaa 


catctggcca 


gtaatgggtg 


caggatacca 


tacgatacac 


840 


cacacgacat 


acaagcataa 


ctatggtcat 


tataccatat 


ggatggattg 


gatgtttggc 


900 


tctcttaggg 


atcctctctt 


agaagaagat 


gacaacaaag 


acagcttcaa 


gaaagcagag 


960 


tgaggatgcc 


cacttggggg 


ttgttcttct 


gtgttgtctt 


gtgttgttgt 


tgtccaaagt 


1020 


ttcagccttt 


cttgttcttt 


ttcttcttct 


tcttattcat 


gtgtctctct 


caacctttcc 


1080 


aattatattg 


ttacaaacat 


ttgctgtcta 


gtttaaaaca 


tgtaaatgtt 


tgatgatctt 


1140 


tccccaaaaa 


aaaaaaaact 


aaattactca 


cactg 






1175 



<210> 24 
<211> 1431 
<212> DNA 

<213> Hevea brasiliensis 
<400> 24 

atggcacgcg cctcccatga cgtgtgggac ctcgaagata cggatcccaa ctacctcatc 60 
gatgaagatc accgtctcgt tacttgccct cccgctaata .tatctactaa gactaccatt 120 
attgccgcac ctaccaaatt gcctacctcg gaacccttaa ttgcaccctt agtctcggag 180 
gaagacgaaa tgatcgtcaa ctccgtcgtg gatgggaaga taccctccta ttctctggag . 240 
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tcgaagctcg 


gggactgcaa 


acgagcggct 


gcgattcgac 


gcgaggcttt 


gcagaggatg 


300 


acaaggaggt cgctggaagg cttgccagta gaagggttcg attacgagtc gattttagga 


360 


caatgctgtg 


aaatgccagt: 


gggatacgtg 


cagattccgg 


tggggattgc 


ggggccgttg 


"420 


ttgctgaacg 


99cgggagta 


ctctgttcca 


atggcgacca 


cggagggttg 


tttggtggcg 


480 


agcactaata 


gagggtgtaa 


ggcgatttac 


ttgtcaggtg 


gggccaccag 


cgtcttgttg 


540 


saggatggca 


tgacaagagc 


gcctgttgta 


agattcgcgt 


cggcgactag 


agccgcggag 


600 


ttgaagttct 


t c 1 1 ggagga 


tcctgacaat 


tttgatacct 


tggccgtagt 


ttttaacaag 


660 


t: c c agt: aga t: 


ttgcgaggct 


ccaaggcatt 


aaatgctcaa 


ttgctggtaa 


gaatctttat 


720 


ataagattca 


gctgcagcac 


tggcgatgca 


atggggatga 


acatggtttc 


taaaggggtt 


780 


caaaacgttc 


ttgaatttct 


tcaaagtgat 


ttttctgata 


tggatgtcat 


tggaatctca 


840 


ggaaattttt 


gttcggataa 


gaagcctgct 


gctgtaaatt 


ggattgaagg 


acgtggcaaa 


900 


tcagttgttt 


gtgaggcaat 


tatcaaggaa 


gaggtggtga 


agaaggtgtt 


gaaaaccaat 


960 


gtggcctccc 


tagtggagct 


taacatgctc 


aagaatcttg 


ctggttctgc 


tgttgctggt 


1020 


gctttgggtg 


gatttaatgc 


ccatgcaggc 


aacatcgtat 


ctgcaatctt 


tattgccact 


1080 


ggccaggatc 


cagcacagaa 


tgttgagagt 


tctcattgca 


ttaccatgat 


ggaagctgtc 


1140 


aatgatggaa 


aggatctcca 


tatctctgtg 


accatgccct 


ccattgaggt 


gggtacagtc 


1200 


ggaggtggaa 


ctcaacttgc 


atctcagtct 


gcttgtctca 


atttgcttgg 


ggtgaagggt 


1260 


gcaaacaaag 

1 


agtcgccagg 


atcaaactca 


aggctccttg 


ctgccatcgt 


agctggttca 


1320 


gttttggctg 


gtgagctctc 


cttgatgtct 


gccattgcag 


ctgggcagct 


tgtcaagagt 


1380 


cacatgaagt 


acaacagagc 


cagcaaagat 


atgtctaaag 


ctgcatctta 


g 


1431 



<210> 25 

<211> 476 

<212> PRT 

<213> Hevea brasiliensis 

<400> 25 

Met Ala Arg Ala Ser His Asp Val Trp Asp Leu Glu Asp Thr Asp Pro 
1 5 10 '15 

Asn Tyr Leu He Asp Glu Asp His Arg Leu Val Thr Cys Pro Pro Ala 
20 25 30 

Asn He Ser Thr Lys Thr Thr He He Ala Ala Pro Thr Lys Leu Pro 
35 40 45 
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Thr Ser Glu Pro Leu lie Ala Pro Leu Val Ser Glu Glu Asp Glu Met 
50 55 60 . 

He Val Asn Ser Val Val Asp Gly Lys He Pro Ser Tyr Ser Leu Glu 
^5 70 75 80 

Ser Lys Leu Gly Asp Cys Lys Arg Ala Ala Ala He Arg Arg Glu Ala 
85 90 95 

Leu Gin Arg Met Thr Arg Arg Ser Leu Glu Gly Leu Pro Val Glu Gly 
100 105 110 

Phe Asp Tyr Glu Ser He Leu Gly Gin Cys Cys Glu Met Pro Val Gly 
115 120 125 

Tyr Val Gin He Pro Val Gly He Ala Gly Pro Leu Leu Leu Asn Gly 
130 135 140 

Arg Glu Tyr Ser Val Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala 
145 150 155 160 

Ser Thr Asn Arg Gly Cys Lys Ala He Tyr Leu Ser Gly Gly Ala Thr 

165 170 ' 175 

Ser Val Leu Leu Lys Asp Gly Met Thr Arg Ala Pro Val Val Arg Phe 
180 185 190 

Ala Ser Ala Thr Arg Ala Ala Glu Leu Lys Phe Phe Leu Glu Asp Pro 
195 200 205 

Asp Asn Phe Asp Thr Leu Ala Val Val Phe Asn Lys Ser Ser Arg Phe 
210 215 220 

Ala Arg Leu Gin Gly He Lys Cys Ser He Ala Gly Lys Asn Leu Tyr 
225 230 235 240 

He Arg Phe Ser Cys Ser Thr Gly Asp Ala Met Gly Met Asn Met Val 
245 250 255 

Ser Lys Gly Val Gin Asn Val Leu Glu Phe Leu Gin Ser Asp Phe Ser 
260 265 270 

Asp Met Asp Val He Gly He Ser Gly Asn Phe Cys Ser Asp Lys Lys 
275 280 285 

Pro Ala Ala Val Asn Trp He Glu Gly Arg Gly Lys Ser Val Val Cys 
290 295 300 

Glu Ala He He Lys Glu Glu Val Val Lys Lys Val Leu Lys Thr Asn 
305 310 315 320 

Val Ala Ser Leu Val Glu Leu Asn Met Leu Lys Asn Leu Ala Gly Ser 
325 330 335 

Ala Val Ala Gly Ala Leu Gly Gly Phe Asn Ala His Ala Gly Asn He 
340 345 350 



BNSOOCID: <WO 02061072A2_I_> 



wo 02/061072 



PCT/IJS02/00255 



28 



Val Ser Ala lie Phe He Ala Thr Gly Gin Asp Pro Ala Gin Asn Val 
355 360 .365 

Glu. Ser Ser His Cys He Thr Met Met Glu Ala Val Asn Asp Gly Lys 
370 375 380 

Asp Leu His He Ser Val Thr Met Pro Ser He Glu Val Gly Thr Val 
385 390 395 400 

Gly Gly Gly Thr Gin Leu Ala Ser Gin Ser Ala Cys Leu Asn Leu Leu 
1 405 410 415 

Gl>\ Val Lys Gly Ala Asn Lys Glu Ser Pro Gly Ser Asn Ser Arg Leu 
\ 420 425 430 

Leu Ala Ala He Val Ala Gly Ser Val Leu Ala Gly Glu Leu Ser Leu 
435 440 445 

Met Ser Ala He Ala Ala Gly Gin Leu Val Lys Ser His Met Lys Tyr 
450 455 460 

Asn Arg Ala Ser Lys Asp Met Ser Lys Ala Ala Ser 
465 470 475 

<210> 26 

<211> 1431 

<212> DNA 

<213> Hevea brasiliensis 

<400> 26 



atggcacgcg 


cctcccatga 


cgtgtgggac 


ctcgaagata 


cggatcccaa 


ctacctcatc 


60 


gatgaagatc 


accgtctcgt 


tacttgccct 


cccgctaata 


tatctactaa 


gactaccatt 


120 


attgccgcac 


ctaccaaatt 


gcctacctcg 


gaacccttaa 


ttgcaccctt 


agtctcggag 


180 


gaagacgaaa 


tgatcgtcaa 


ctccgtcgtg 


gatgggaaga 


taccctccta 


ttctctggag 


240 


tcgaagctcg 


gggactgcaa 


acgagcggct 


gcgattcgac 


gcgaggcttt 


gcagaggatg 


300 


acaaggaggt 


cgctggaagg 


cttgccagta 


gaagggttcg 


attacgagtc 


gattttagga 


360 


caatgctgtg 


aaatgccagt 


gggatacgtg 


cagattccgg 


tggggattgc 


g99gc:cgttg 


420 


ttgctgaacg 


ggcgggagta 


ctctgttcca 


atggcgacca 


cggagggttg 


tttggtggcg 


480 


agcactaata 


gagggtgtaa 


ggcgatttac 


ttgtcaggtg 


gggccaccag 


cgtcttgttg 


540 


aaggatggca 


tgacaagagc 


gcctgttgta 


agattcgcgt 


cggcgactag 


agccgcggag 


600 


ttgaagttct 


tcttggagga 


tcctgacaat 


tttgatacct 


tggccgtagt 


ttttaacaag 


660 


tccagtagat 


ttgcgaggct 


ccaaggcatt 


aaatgctcaa 


ttgctggtaa 


gaatctttat 


720 


ataagattca 


gctgcagcac 


tggcgatgca 


atggggatga 


acatggtttc 


taaaggggtt 


780 
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caaaacgttc ttgaatttct tcaaagtgat ttttctgata tggatgtcat tggaatctca 840 

ggaaattttt gttcggataa gaagcctgct gctgtaaatt ggattgaagg acgtggcaaa 900 

tcagttgttt gtgaggcaat tatcaaggaa gaggtggtga agaaggtgtt gaaaaccaat 960 

gtggcctccc tagtggagct taacatgctc aagaatcttg -etggttctgc tgttgctggt 1020 

gctttgggtg gatttaatgc ccatgcaggc aacatcgtat ctgcaatctt tattgccact 1080 

ggccaggatc cagcacagaa tgttgagagt tctcattgca ttaccatgat ggaagctgtc 114 0 

aatgatggaa aggatctcca tatctctgtg accatgccct ccattgaggt gggtacagtc 1200 

9gaggtggaa ctcaacttgc atctcagtct gcttgtctca atttgcttgg ggtgaagggt 1260 

gcaaacaaag agtcgccagg atcaaactca aggctccttg ctgccatcgt agctggttca 1320 

gttttggctg gtgagctctc cttgatgtct gccattgcag ctgggcagct tgtcaagagt 13 80 

cacatgaagt acaacagatc cgccaaagat atgtctaaag ctgcatctta g 1431 

<210> 27 
<211> 476 
<212> PRT 

<213> Hevea brasiliensis 
<400> 27 

Met Ala Arg Ala Ser His Asp Val Trp Asp Leu Glu Asp Thr Asp Pro 
15 10 15 

Asn Tyr Leu lie Asp Glu Asp His Arg Leu Val Thr Cys Pro Pro Ala 
20 25 30 

Asn lie Ser Thr Lys Thr Thr lie lie Ala Ala Pro Thr Lys Leu Pro 
35 40 45 

Thr Ser Glu Pro Leu He Ala Pro Leu Val Ser Glu Glu Asp Glu Met 
50 55 60 

He Val Asn Ser Val Val Asp Gly Lys He Pro Ser Tyr Ser Leu Glu 
65 70 75 80 

Ser Lys Leu Gly Asp Cys Lys Arg Ala Ala Ala He Arg Arg Glu Ala 
85 90 95 

Leu Gin Arg Met Thr Arg Arg Ser Leu Glu Gly Leu Pro Val Glu Gly 
100 105 110 

Phe Asp Tyr Glu Ser He Leu Gly Gin Cys Cys Glu Met Pro Val Gly 
115 120 125 

Tyr Val Gin He Pro Val Gly He Ala Gly Pro Leu Leu Leu Asn Gly 
130 135 140 
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Arg *Glu Tyr Ser Val Pro Met Ala Thr Thr Glu Gly Cys Leu Val Ala 
145 150 155 160 

Ser Thr Asn Arg Gly Cys Lys Ala lie Tyr Leu Ser Gly Gly Ala Thr 
165 170 175 

Ser Val Leu Leu Lys Asp Gly Met Thr Arg Ala Pro Val Val Arg Phe 
180 185 190 

Ala Ser Ala Thr Arg Ala Ala Glu Leu Lys Phe Phe Leu Glu Asp Pro 
195 200 205 

i 

Asp Asn Phe Asp Thr Leu Ala Val Val Phe Asn Lys Ser Ser Arg Phe 
\ 210 215 220 

Ala Arg Leu Gin Gly He Lys Cys Ser lie Ala Gly Lys Asn Leu Tyr 
225 230 235 240 

lie Arg Phe Ser Cys Ser Thr Gly Asp Ala Met Gly Met Asn Met Val 
245 250 255 

Ser Lys Gly Val Gin Asn Val Leu Glu Phe Leu Gin Ser Asp Phe Ser 
260 265 270 

Asp Met Asp Val lie Gly lie Ser Gly Asn Phe Cys Ser Asp Lys Lys 
275 280 285 

Pro Ala Ala Val Asn Trp lie Glu Gly Arg Gly Lys Ser Val Val Cys 
290 295 300 

Glu Ala lie lie Lys Glu Glu Val Val Lys Lys Val Leu Lys Thr Asn 
305 310 315 320 

Val Ala Ser Leu Val Glu Leu Asn Met Leu Lys Asn Leu Ala Gly Ser 
325 330 335 

Ala Val Ala Gly Ala Leu Gly Gly Phe Asn Ala His Ala Gly Asn He 
340 345 350 

Val Ser Ala He Phe He Ala Thr Gly Gin Asp Pro Ala Gin Asn Val 
355 360 365 

Glu Ser Ser His Cys He Thr Met Met Glu Ala Val Asn Asp Gly Lys 
370 375 380 

Asp Leu His He Ser Val Thr Met Pro Ser He Glu Val Gly Thr Val 
385 390 395 400 

Gly Gly Gly Thr Gin Leu Ala Ser Gin Ser Ala Cys Leu Asn Leu Leu 
405 410 415 

Gly Val Lys Gly Ala Asn Lys Glu Ser Pro Gly Ser Asn Ser Arg Leu 
420 425 430 

Leu Ala Ala He Val Ala Gly Ser Val Leu Ala Gly Glu Leu Ser Leu 
435 440 445 
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Met Ser Ala lie Ala Ala Gly Gin Leu Val Lys Ser His Met Lys Tyr 
450 455 460 

Asn Arg Ser Ala Lys Asp Met Ser Lys Ala Ala Ser 
465 470 475 

<210> 28 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> misc feature 



<222> 


(1) . . (22) 


<223> 


Primer 


<400> 


28 


gagatctgaa ccctaacgag ag 


<2±0> 


29 


<211> 


26 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<221> 


misc feature 


<222> 


(1>..(26> 


<223> 


Primer 



22 



<400> 29 

ggagctctta agaaaaggga cgacgc 26 



<210> 


30 


<211> 


26 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<221> 


misc feature 


<222> 


(1) . . (26) 


<223> 


Primer 


<400> 


30 



gtctctgaat cagaaatcct tctatc 2 6 

<210> 31 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 
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<221> misc_f eature 
<222> (1) . . (25) 
<223> Primer 



<400> 31 

catgtcaaat ttcactgctt catcc 25 

<210> 32 

<211> 31 

<212> DNA 

<21l3> Artificial sequence 

\ 

<226> 

<221> misc_feature 

<222> (1) . . (31) 

<223> Primer 



<400> 32 

gagatctcca cagatttaaa gaaccctaac g 31 



<210> 


33 


<211> 


31 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<221>. 


misc feature 


<222> 


(1) - . (31) 


<223> 


Primer 


<400> 


33 



ggagctcggt ttttaagaaa agggacgacg c 31 
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constructs or vectors containing DNA sequences encoding a 
3-hydroxy-3-methylglutaryl-Coenzy A reductase (HM6R) and a 
so-called steroid pathway enzyme; cells, plant parts and 
whole plants resulting from said method and oils extracted 
from said plant parts. 



2. Claims: 55 (completely), 57 - 62 (all partially) 

relating to SEQ ID N0s:4,6,8 an Arabidopsis squalene 
epoxidase, constructs, vectors and host cells containing 
said SEQ ID NOs and methods of producing said squalene 
epxoidase by culturing said host cells. 



3. Claims: 56 (completely), 57 - 62 (all partially) 

relating to SEQ ID N0s:12,14,17 an Arabidopsis obtusifoliol 
C14-alpha demethylase, constructs, vectors and host cells 
containing said SEQ ID NOs and methods of producing said 
obtusifoliol C14-alpha demethylase by culturing said host 
cells. 



4. Claims: 69 - 77 

relating to a method of increasing steroid pathway products 
and tocopherols by transforming host cells with vectors 
containing DNA sequences encoding a HGNR, a so-called 
steroid pathway enzyme and a so-called tocopherol synthesis 
pathway enzyme; plants resulting form said method and oil 
extracted from the seeds of said plants. 
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